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(71) We, NCR CORPORATION, for- 
medy The Natiaaal Cash Register Cosnpany 
of Dayton in the State of CAuo, and Balti^ 
the State of Maryland, United States 
of America, a coipoiratiosi oi^ganized under 
the laws of the Stale of Maryland, Unked 
States of America, do hereby declare the in- 
vention, for ^lidi we pray that a potest may 
be granted to us, and the method by which 
k is to be perf ormed, to be parricukd-y de- 
scribed in and by the foUbwing statement: — 

This invention rdaies to data processing 



Accoiding to the present invendon, there 
is provided a data processing system when 
candit^oned by porograming means which m- 
dudes translarion means arranged to cause the 
data processing system to translate a source 
pro gy^ J n ai high Ievel_soiaDCLlanguage^ia^ 

said^S^^^^^^^^^^mi^^^^inToper- 
atioxir to separate executable statements (as 
herein deGned) from nosi-execinable infoima- 
tion (as herein defined). 

It will be apprediated that a data prcxxss- 
ing system aorording to tis immediately pre- 
ceding paragra]^ has the advamage that ^ikc 
|teiSiOiBe^^d]^^^3^ 



^t^^jjjte|^m5g^ 

<?^j^f^^i ^g^i<y mpi^ 
mj^pll^fra^^ 

^n^^a&^^y^S^c&xxsd herein that a "higji 
level language^' is a progranmimg language 
wherein each instructioa or statement gener- 
ally corresponds to a phuaHty of machme 
code instinicticms. Ezampls of hig^ level lan- 
guages are ALGOL, FORTRAN, COBOL 
and PL/1. A "low level language" is a pro^ 
gramming language wherein eadi instruction 
generally conespands to a sipgle machine code 
instmcdoa. The process of converting a pro- 
gram from a hagh level language into a low 
levd knguagp is generally referred to as "com- 



piling", and a program which effects such 

a conversioD is generally referred to as a 50 

"conqjiler". 

It should also be noted that an "execut- 
aUe statement" herein is meant a program 
statement ^ch specifies an operation to be 
performed on data. By "noa-executable in- 55 
fonnatton" is meant information which does 
not specify an operatL(»i to be performed^ on 
data; for exam|de infomadon q)ecifyiiig the 
cbaracterisdcs of data. 

One embodiment of the invendon will now 60 
be described by way of example with refer- 
enoe to the accompanying drawings^, in 
wiuch:— 

Fig. 1 is a block diagram blowing the 
various stages involved in a computer system 65 
using a generalized compiler; 

Fig. 2 shows a block diagram of a specific 
embodimenc wherein any of a number of 
commonly used source languages and any one 
of a dass of computers are provided with 70 
lartgiia gf tran^atoxs mglring ihem adaptable 
for use with the generalized compiler; 

Fig. 3 is a schematic dran^nng of the 
sequential steps used in making a source pro- 
gram available for a specific host computer; 75 

Fig. 4 is a schematic showmg how the 
various mamfestadons of the source program 
are acted vcpon by elements of the computer 
system in order to transform them into ad- 
vanced stages or conditions for final use of 80 
the host computer; 

Fig. 5 is a schematic drawing layixig out 
the structure of an "exeaxtable" statement 
in the format of a First Intennediate Lan- 
guage; 85 

F^ 6 and 7 are exemplary tree schematics 
illustmtmg informarion heir^xhies in a new 
Metalanguage; 

Fig. 8 is the organizationat diart showing 
how the set of the major mformadon struc- 90 
tures caned:"gr^p;'-7is^a^ivi<ted 
subsets each of \duch are sucojssively'furtlM 
subdivided into twoi analler sid>sets^ 

Fig. 9 is an organization chaxt shbwing how 
the set of all elements of information, e^ is 95 
subdivided into two major subsets and how 
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Fig. 10 is ^^illustration of the use erf the 
new Metalanguage to specify one type of IF 
statement. The inf onnation unit^ Eln, is ^own 
expanded into succ^sivdy lower levels of 
complexity of information. 

Figs. 11 tfanH^ 14 arc drawings of vari- 
ous configurations of computer systems in- 
corpofrating the generaliasd compiler concq>t. 



Descripiion 
The large scale integration (LSI) phaio- 
menon in the semiconductor tedmology is 
having far reaching impact on the entire in- 

15 formation pitxsssing industty. The avad- 
ability of inexpensive and rdiable hardware 
building blocks containing complex lo^c cir- 
cuits opens realistic avenues for the develop- 
ment of more efficient computing systems. 

20 The ever iiKsieaang cost and complexity of 
the computer software on the other hand, 
necessitates a new look to the ccsnputer sys- 
tem architecture and hardware-software inter- 
relationship. 

25 Program cwnpilatioo ofteo requires more 
computer time tiian executicMi of the program. 
A significant amount of computer power is 
spent for the trandation of higji IctcI pro- 
gramming langwses into marhinc language. 

30 Compilers have mcreased in size frran 10^0 
K words for FORTRAN or COBOL \xp to 
and over 195 K words for one ver^on of 
FORTRAN. Hours <tf expensive computer 
time are consumed by the compiler. Other 

35 ineffidences are caused by such tedious opera- 
tions as compiler and prc^ram debug^ng ami 
mamtenance. In multiprogramming and time- 
sharing systemsfi ccnnpilipg causes sigmficant 
system overhead due to increased program 

40 swapping. Computers require laig^' main 
memories because of the coie requirements <rf 
compDeis. 

Generally, compiler costs are an exponen- 
tial functicm of the size of the compilear tadc 

45 which is incorporated into cHie integrated unit 
Further growth in the complexity of the 
source language is severely linrn't prf by the 
incrrasing size and cost of the compiler using 
current cmnpiler techniques. By dividing the 

50 compiler tasks into independent isc^ted 
mothiles which separate the user problem 
transformation task from the language syntax 
and semantics task, these limitations can be 
reduced and the compiling task Amplified. 

55 Compilers do not attempt to separate lan- 
guage translaticm tasks from problem trans- 
formation and consequently any change in 
either the source lai^uage or machine code 
can cause changes througjiout the campiler. 

60 Source languages and computer hardware are 
in a constant state of flus and as a result a 
new cranpiler must be written for p^rh stwrce 
language each time a new computer is de- 
signed. Much of this work can be avoided by 



recognizuQg and taking adviantage of the fact 
that compiler tasks can be separated mto two 
independent classifications: language transla- 
ticm and problem transformation. By keqpdng 
these groins of tasks separated in the com- 
piler, the impact of a change in the source 
language or a changq in the computer hard- 
ware can be limited to the TgngiiflgR trandation 
porticHi of the campiler. The part of the com- 
piler concemed with the transftmnation of 
the user's pooblem need not be changed be- 
cause chflnffl'ng the language used to state the 
problem does not change the problem. 

The generalized compiler herein is based 
on clearly separating "language traislation'' 
from **i»oblem transf oimation*'. If a source 
program is defined as the oHnbination of a 
problein algorithm and the exposition of that 
algorithm in some source language, then^ the 
problem transf oitmation consists of the map- 
ping of the problem algoritinn from a com- 
plex function-oriented format into a Ampler 
opsration sequence format Language tians^ 
lation consists of chai^;ing the fomiat of the 

ingj^SS^m^^llangj^e^^ 

Then the problem^ in intermediate lan- 
guage;, is transformed from a set of procedures 
and algeforaic expressirais into a secpience of 
computer operations. Finally a second lan- 
guage transformation is performed to convert 
die above machin&'independent output into 
operation codes for a ^:^ci£c computer. By 
designing the compiler as three independent 
modules^ the impact of a change is linuted. 
Thus> a chai^ in source lai^^uage only affects 
the first module, the source lai^uage trans- 
lator. Likewise the introduction St a new 
ccHnputer with a new machine code instruc- 
tion set simply requires the change or replace- 
ment of the second langiTagp translator. In 
dther case> the problem transfonnation por- 
tion of the compiler and one language trans- 
lator remains unchanged. If a new computer 
using a new implementati(Hi language is intro- 
duced, the compiler may have to be trans- 
lated into the new implementaticm language. 
Even this task is ccMisiderably smalW rfian 
traiKlating a conventional compiler. 

A large part of the con:^>iler is in inter- 
mediate language which is independent of 
both the source -language and. die ^)edfic 
computer used for program executicm. Cbn- 
sequentiy, it may be feasible to implement 
this part of the campiler in hardware yiTtrp it 
is reasonably insensitive to chanrges. 

Intermediate Language 

In order to separate the compiler tasks into 
indq>endent modules and thereby reduce the 
complexity, problem transformation is separ- 
ated from language translation. To do this, it 
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is necessary to be able to* state tAe problem 
in a language whidi is ii]dq>endent of the 
source language. This is the pinpose of hav- 
ing an imenncdiate language (IL). This also 
defines the natuie of the IL. The EL language 
can be divided into two ind^>eadeQt paiits: 
nonexecutatde and executable. The nonexe- 
cutable part essentially specifies the diaiac- 
isttcs of die data and die executable part 
specifies the fanctioos to be perf onned m 
the data. 

All nonexecutable souxce program informa- 
tion is passed to the genetalized confer by 
means of tables i^ch are collectiveiy called 
a symbol table. Eadi ideodfier in the source 
program occupies one symbcd tal^e word. 
Various attributes of this idmdfier are coded 
into the proper bit positioas in ibis word 
during the language translatioa of the source 
langcage. The "identffiers" are nounal items 
sudi as A> Q or X» Y, Z which identify 
the subject matter in a given algorithm. For 
example: p^=hou]:s^wo(rked x houriy-rate 
or C=A X B; Thus A, B, Q pay, hours- 
wtxrked, and hourly-rate arc idmtifiers. 

The executable part spediSes the functions 
or operatioDs reqmred and the sequence m 
which they are to be perfonned. The same 
function may be specified m a. different way 
in each soinre language. However, the XL 
need provide only one way of spedfyit^ a 
q>0cific function. The total number of fimc- 
tions described in IL will be greater than 
the number incorporated into any one source 
lai^uage ^nce the various source languages do 
not all GQCon^yass the same set of funcdcMis. 
likewise;, the IL requires a larger set of 
operators dian any single source language. 

In the practical jmpjem enta rion herem pro- 
posed, 




'8J^*i!iFiS'Sfi6lishe£t it can be expanded 
by the later addidon of more functions and 
more opexstors without die complications such 
changes would cause in axrrent compilers. In 
facti dtkldng of. source language in terms of 
new problem functions may be a great aid 
in the more orderiy devdofxnent of a more 
powerful, ample to use^ gwieralized source 
languagje. In addition, die sq>aratioa of data 
attributes is compatible with die concepts of a 
comman data base. 



Source Lemguage Translator 

In order to separate the problem transfor- 
mation task from the language tr^nsladoo, the 
problem must be stated in a manner that is 
60 source language independent Tliis is the task 
of the swirce language translator. The source 
lan g uage trandator (SLT) strips away the 
excess verbiage, redundancy, and ideosyncra- 
des frcm the source language. (One SLT is 



required, fori each source language.) It^ recog-^i* 65 , 
nizes source language qperator coides and key- 
words and tran^ates ti»^ as well as execut- 
able prooeduxes and algebraic expressions into 
intennediate language (IL — 1 format). It 
places all names and declared attributes of 70 
the data and the environment into the symbol 
table replacing the names (operands) in die 
executable program with symbol table 
addresses. The IL~1 ii^put format for expres- 
sion may consist of alienating operands and 75 
operators. If an operand is not present be^ 
tween two operators in a source language 
expression, die SLT supplies a ^>ecial "no 
operand" code. This is one method of dimin- 
inadng a diq>licattoin of language translation 80 
widiin the generally comp^. 

Source Imiguages widi a precedence algo- 
rithm different from the IL precedence ^algo- - 
rithm aire translated by the addition of paren- 
theses by the SLT. 85 

Generalized Compiler 

A Problem Transformation Module (Gener- 
alized CcHnpilo', GC) is used to transform the 
executable procedures and expressions (which 
are in a standard IL — 1 fcnmat indq)endent 90 
of the source langirage) from a problem or 
funcdon oriented foim into a computer opera- 
tion form condsdng of a sequence of machine 
level instructions perfonned on elementary 
variaHes. The GC reorders the sequence of the 95 
operations of an expression according to a 
commonly used precedence algorithm. In 
addition to the expression processor routine, 
the GC contains routines for the transfoocmaT 
tion of all other executable statements. Apf^o- 100 
priate code sequences are generated to trans- 
fer control to functions and sub-routines and 
to return as wdl as to test DO loop conditions 
and to increment to DO loop variables. - -// 

Mode Code Translator 105 

The Machine Code Trandator (MCT) is a 
simple language translator to translate the 
output operation codes of the GC into the 
machine codes of a ^>ecific computer. It 
combines symbc^ table infonnadon and mach- 110 
ine code skeletons with the output (operand- 
opaator-operand) triads the GC 

System Organization 

Fig. 1 shows the system organization iidiere- 
by die Source Program (in a high-level Ian- 115 
guagc such as FORTRAN, COBOL, etc.) is 
hairfled by ^ a* Source Language Translator 
10 (SLT) separating executable statements 
from non-executable information and is con- 
verted to a Fkst Intermediate Language 120 
(IL— 1). The data firam the SLT 10 is dien 
handled by the Generalized Compiler 11 
(GC) whidi converts the statements into a 
Second Intermediate Language (IL— 2) which 
is then handled by the MCT 14 in order to 125 
convert the program information into a speci- 
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The GC 11 provides ±c functioo of "prob- 
lem transfomatum" with a Problem Trans- 
5 fonnadon Prc^ram 12 and the function of 
symbol tran^tioa with Symbol Table 13. 

Fig. 2 shows a more q)eci£ic system ^^lere 
each individoal high levd Source Lmigaagie, 
as C, F, or A, is handled by spcdfbc Source 
10 Language Trmslators 10c, lOf, 10a, etc., to 
convert infoomation into IL — 1. Then GC 11 
provides the program infonnadon in IL — 2 
to specific Machine Code Translatotrs as 14^^ 
or 14ni2, or 14n,„ dqjending on the ^)edfic 
15 make of the host computer. 

Sjyurce Language Translator Tasks 

The four principal t^^s.pf the. Source Lan- 
|guage Translator aie: 

1) Identify all source language symbols. 
20 I 2) Create a symbol tabk dt all data attri- 
butes. 

3) Translate all executable source language 
statements into intennediate language. 
fA) Detect source language syntax errors. 

25 All source language symbols must be recog- 
nized. The source language character set is 
trandated as necessary to confann to the IL 
code format. Each delimiter (^mbd ¥^ch 
communicates ^'separation" of items of infor- 

30 matiom for control purposes) is replaced by its 
equivalent IL code. Each idendher is added 
to the symbd table u^ess it is already listed. 
In either case, each idemifier is replaced by 
its symbol taHe address in all executable 

35 statements. 

In terms of seqi^nces, Flgp 3 illustzates 
the conditions (a), (b), (c), (d), wheae 

the Source PrDgram. _in a lE^ Level Lan- 
guage, is converted & a Imsermediatc 

<0 Lai^;uage, dien to a Seaxzd Intermediate 
Language, and finally to being the Source 
Program in Object Machine Code \^^iere it is 
usuable for the host computer. 
Fig. 4 illustrates the points of impdi>gement 

45 of: the SLT 10 Prc^^am mi the Source ^o- 
gram (a) to convert it to cooditioa (b); the 
GC 11 program which converts the Source 
Program from Hrst Intennediate Language 
condition (b) to Second Ihtiennediate Lan- 

50 guage conditicm (c); the MCT 14 program 
whidi converts the Source Ptc^ram in con- 
dition (c) from SoroiKi Intermediate Language 
^.t»^.0(mditi<m .(d) into .Object Machine . Code. 
Symbols in a Source Language may be 

55 categorized to include those shown in the 
following Table 1. 

(A) Ddindter 

1. Operator 

a. Primitive (+, -, *> .GE, etc) 
60 b. Keyword Executable Command 

(D. GO TO, MOVE, etc) 

2. Punctuation (separatiwi, control) 



(B) Fd^Oifter 

1. Keyword: Program structure name 
(procedure, Wod^ divisicaia sectiOTt, etc) 65 

2. Statement Labd 

3. Variable (includes constants, literals) 

4. Function 

5. Subscript 

6. Aigmnent 70 



7. 



ion 



(C) Comment 

(D) Irsxdid Symbd or Keyword 

AU the data declaration and envircmment 
description infonnatioa is placed into the Sym" 75 
boi Table 13. Comments are detoed. The 
symbol table takes the place of all nonexecut- 
able statonents when the source program is 
in intermediate language. 

Each executable source lang'^^g'^ ytgmj-np nr 80 
is translated into intermediate langimga 
EsseutiaUy this consists eliminarii^ & re- 
dundancy and ideosyncracies ami reordering 
the inf onnation into a language independent 
format Key words are replao^ by the eqiu- 85 
valent IL code which always occurs as the 
first word of a statement. Arguments and 
parameters appear in a specified order separ- 
ated by commas or some other deHmiter but 
without the memory and readability aids 90 
found in source langimges. 

An "end of expression" code is added to 
the end of ea ch statement function expression 
and the name oi the expresaoa is stored in 
a special area in the syinbol table 13 so that 95 
the symbol table addiess which replaces the 
name m the Somce Prc^ram is identifiable 
as an expression nflmp? Statement functions 
(FORTRAN) and similar souroe language 
deric^ specify the solution of an equation IQQ 
each tune the name of the expression appears^ 

The IL code for "NO OPERAND" is 
inserted in between every pair of operators 
in the Source Program nduch are not separ- 
ated by an qperand as necessary to avoid 105 
diqdicating syntax lecogintiQa m the geoerat- 
iz^ compiler. 

Eadi source language statement transiated 
into intermediate lan^iagfi is consecudvdy 
numbered. Invalid diaractm are noted and 110 
converted into blanks. Error rtnitin^ are m- 
v(^ed for invalid keywords aiKi odier syntax 
errors. Parentheses are added as needed to 
. convert the precedence implied. in .the source, v.. . ^ 
language into the precedence conventions of 115 
the intermediate language; in case a difference 



Source language syntax errors such as 
illegal characters, ddimftprs, keywords;, and 
identifiers are detected. 120 

Any of the currmdy used techniques for 
recognizing and transilating source lar^age 
can be adapted for this computer system. 
Syntax-directed compiling and t^e driven 



1,367,741 



5 



^ ' ' omipilxng are; the most* common tfedmiques 
used. The characteiisdc of the newiy inTcmed 
system are: 

1. The translati(Mx is much simpler because 
5 it is a transl^oQ into another high 

level language rather tiian a translatioii 
into mahhiTm code. 

2. The impact of a change ia the soisce 
language is Itmitrd to a small pait of 

10 the compiler system rather than affect- 
ing the entire computer system. 

Problem Trmssformadon Tcssks 

The principal tasks of the proiiiem. trans- 
foimaiioQ module 12 arc to: 

15 1) Transfoxm problem funcdcms into com- 
puter operaticm sequences. 

2) Opdmize the object code generated. 

3) Detect errors. 

The executaMe statements are transf oaned 
20 from hig^ level fimcdoQ-oriented format into 
computer beware (^)eratioa-oriented code 
and by rearranging the sequence in \^ich the 
operations appear according to operator pre- 
cedence rules of an iittennediate language 
25 created as psrt of the system. The btcraie- 
diate language operates sudi that: 

(a) the keywoni in each statement directs the 
problem transfonnadon oHitrol t3o the rou- 
tine required to transfoim that statement 

SO For many statements^ a direct transfonna- 
tion can be made u^ng a ccnie ^eieton 
and substitutmg arguments from the state- 
ment for dummy variables. 

(b) an expression proc^sor is used to trans- 
35 foim expressions within the statement. This 

processor uses one or more pu^-down 
stadcs to aid in reordering expressicHis to 
meet the precedence criteria established. 
A fm^-down stack is also used to permit 

40 the use of recursion within esprcssaons. 
Eadi time a triad (two operands and an 
operatcH*) encounter an operator of lesser 
precedence^ die triad is removed from the 
expression as EL output code. The triad is 

45 replaced in the expression by the symbol 
table address which ^)edfies y/hero the 
triad ouq>ut code operaticHi result is stored. 

The d>ject code is optimized by efforts 
, , . .vSudi as the .elimination of the repeated execu- 
50 tion of the solution <5f We s^^ 

whenever the values have not charged in the 
interim. The assignment of index registers to 
expedite address cwnputation is another opti- 
mization technique If the number of registers 
55 is limited, optimization will dictate that the 
innennost of a DO loop receive pre- 
ference over outer loops. Conflicts may also 
occur between local and gM>al optimization. 
Ettot detection includes finding illegal 



sequences of opmnds and o^eratorr ^ch the ' 60 
expressi(m i^ocessor will detect. In addition, 
the varic»2s keyword statement routines will 
detect missing argument for which no alter- 
natives are specified and for vfbidx default 
values are not provided. 65 

Second Lmguage Translator Tasks 

The principal Second Langixage Tr^ator 
tasks are: 

1) Translate the problem transfomiatioQ out- 
put (triads) into madune code for a ^^edfic 70 
ingplii'ni> uang code skdetcms and symbol 
table infonnation. This is accomplished by 

die Machine Code Translator 14 of Fig. 1. 

2) Decide data type dominance for operations 

oa operands of two different data types. 75 
Generate cdde needed to convert data type. 

3) Assign storage locations to all o^ect code 
identifiers in the symbol table. Replace the 
symbcd table addresses of these identifiers 
with the assigned storage locations. 80 

The purpose of the second l anguage trans- 
lator is to convert the machine-level machine- 
independent proUem transformation output 
triacls and otirci machme-indqpendent instruc- 
tions into instructions (primitive code) for a B5 
specific machine. The im|»)rtant point to hasp 
in mind is that the only impact necessary on 
a compiler due to a machine code dhange is 
a language translation. Therefore, the com- 
piler should be designed so that parts of the 90 
confer not involved in machme code 
generatson are independent of the features of 
a specific machine. This wiH insure that the 
impact of a change to a new machine will 
be stricdy limited to the code generation 95 
skdetonsw It is the job the code generator 
to provide the object code necessary to exe- 
cute the pit^ram instructions on a ^^fic 
computer. To do tius^ the code generator 
uses the combined infoimation of the output 100 
of the Problem Transfonnation Module Ex- 
pression Processor, the information m the 
symbol tables, and the code ^detons. 

The code skeleton sdected depends not onlv 
<m the particular machine instnictioQ as spod- 105 
fied by the operator (symbols used to repie- 
seat the "action" to be done on the ideairifiere, 
or operands) but also on die attributes of the 
data involved. The attributes of the data axe 
specified or imj^ed for each operand in the 110 
symbol table at the , sddiess wlu(^ 
vatiie' " of thfe " operand. Eadi * symbol table 
attribute field may be coiosidered to be part 
of a micro-program instruction needed for a 
particular comHnation of attributes. 115 

Every data type has a predefined domin- 
ance rdationdiip in liberations on two opcr- 
ands. ITmsi, for example, a multij^y operation 
involving a fixed-point and a floating-point 
number requires machine code instructions to 120 
convert the data type of one of the numbers 
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In this case, floatnig-point dommates fixed- 
point and oc»isoquaidy> the fixed-pomt num- 
ber is cofoverted into floatmg-^Kiint After the 
5 multii^cadoai, the answer is converted as 
needed to matcfi the t^^ qxecified for the 
answer. 

Hie final task of die Second Language 
Translator is to replace each symbol table 

10 address in the object code with the storage 
addr^ allocated to it and listed in the sym- 
Ixl table; This task is done after the program 
is in the machine code for a spedBc target 
computer inasmuch as it is poss^ that the 

15 translatioa to a qjedfic machine code may 
not be one^to-on& If an additional instruction 
were rcquired to be inserted after storage 
aUqcatiqa^ each reference to aisy address be- 
yond the point of insertion woi^ have to be 

20 located and changed. SiKih a ta^ would be 
far too difficult to praoticaL 

Descriptim of the Intermediate Lmeumes 
(IL) 

Func&md Organisation: The function of 
25 IL is to describe the user program interfaces 
with the Generalized CcHnf^er. It has f onnats 
that are ind^randent of terth the source lan- 
guage and the computer on which the user 
program is to be executed. User p i-ogi am 
30 statements fall into two categcmes: 

1) Nonexecutable infonnation 

2) ^ecutaHe statements 
Nonexecutable statements are teed for ncm- 

changing information such as data attributes, 
35 £Ue attributes, storage aHocaticov and control 
infoimation. In IL, tins infonnaticm is in 
the f ozm of a symbol tabl& 

Executable statemeitts describe the actions 
to be performed on the source data during 
*40'^*jpttygram execution in order to obtain the 
solution of the sohition of the user's prrfdem. 
ExecutaWe statements are ccmisf onned m the 
generalized compiler from higher level, func- 
tion-oriented language fonnat, II^l, mto 
45 r ^chm e level operatioi>-osaented fomia^ 

Nonexect^le Information: Before the 
start of proWem transformadwi, the infonna- 
tion from non-executable statements has been 

50 translated into symbol table entries by the 
SLT 10 of Fig. 1. In addition, every identi- 
fier appearing in any executable aatement 
has been listed in the symbol table. Thust, the 
. ,S]^bo!,^jable ..13 . ccsitains.- the names of . 

55' identifiers in the source program and the 
attributes which have been specified. 

Attributes are grwiped into fields in the 
symboJ table. Any attribute within a group can 
be specified by placing its assigned code in 

60 the field. If no code is placed into the field, 
a default code will be used. The defaidt codes 
assigned by the compiler are loaded into die 
symbol table prior to the start of lai^;uage 
translation. The programmer may declare his 



own choice of default attributes in his pr(>-~ 65 
gram. 

In addition to data attributes) ncmexecut- 
able informatiaDt in the source prc^ram in- 
cludes names and sizes of arrays^ relative 
storage locations of identifiers (COMMON 70 
and EQUIVALENCE)^ formats of data in 
external files, editing specifications;, informa- 
tion abcHit the hierardiial structure of data, 
external names, Statement Function names, 
entry names, function and library routines^, 75 
and communications with the operating sys- 
tem and compQer. All of this infoitnatioa is 
placed into the symbol table 13 by the 
(SLT) Source Language TraiKlator 10. * 

During problem traiisfoxmaticHi, additional 80 
names ^ facts are added to the symbtd 
table. The names assigned to. the storage of 
intermediate results are added In addition, 
records may be kept of the most recent time 
at whidi each intermediate lesuk has been 85 
computed in the program without a zckss^- 
quent change in any of the variaMes used in 
the computation. 

The symbol table is used by the (MCT) 
Machine Code Translator 14 module in ccHi- 90 
junction with code skdetons to generate 
machiTift instriKtions. The marhinf; language 
instruction sequence selected depends upon 
not only the operator involved but also the 
attributes of tl^ of^rands. During this time 95 
the (q)erands in the user's program are all 
symbol table addresses which makes it simple 
to loak up the attrit»it]^ 

The MCT 14 assigns storage locations to 
each operand in the symbol table: These JOO 
addresses may be absohite or rdativ& Rela- 
tive addressing is used, especially In multi- 
prpgramming systemsi, to permit die operat- 
ing system to assign the actual^^orage to be 
used. 105 

Executable Statements 

At the start of problem transformation, 
executabte statements ait in IL — 1 format 
and ooQsist of a keyword followed by argu- 
ments each of which is followed by an EOE 110 
code (end of expression). Arguments may be 
c l e mf nts or expressions. Expressions consist 
of a sequence of operand-(^)erator pairs. 
Operands are symbol table addresses of either 
elements or expressions. 115 

Expressions may contain expressicHis which 
ccxitam expressions to any level of nesting 
vrithitt the limits of the- Exprtsaon'ProcQisor' ' ' 
push-down stack. 'Essdki expression is termin- 
ated with an EOE (end of expression) code 120 
which causes the stack to pop up one level. 
The EOE code of the argument is at the 
first level and it causes a return of control 
frran the Esqpiesaon Processor to the routine 
specified by the keyword of the statment 125 
being processed. The Expressicm Processor 
also returns the symbol table address of the 
name assigned to the expression. The name 
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is nhe address at wbidb the value of the ex- 
pression is stored. The names of expressions 
are stored in a special area of the symbol 
table so that the compiler can recognize ex- 

5 pressions within expressions by lodking at the 
values of the operands all of whidi are sym- 
bol table addresses. 

During problem transformati(»a, each exe- 
cutable statement is transfomed into a 

10 sequence of simple insttuctions consisting of 
eiAer an infix operator and two operands or a 
prefix operatOT and one opemid. 

The keyword specifies the routine to be 
used for tihe transformation- The position of 

15 the argument in the sequence of arguments 
establi^es how it will be transfonned or 
used by the oHnpiler. Generaliy, the argu- 
ments will be processed by the Expr^aon 
Processor which generates die sequence of ' ' 

20 "i«^<"p^ instructLQOS necessary to compute 
the value of the argument. The address of this 
value is combined with the code sikdetoa for 
tbe statement being processed. The code skele- 
ton for ^ executable statement is a sequeinx 

25 of madune level instrucdcHis (triads hke the 
output of the expression processor) with sped' 
fic locations within this sequence assigned to 
the value of each argument This problem 
tnuxsformaticm module contains a code ^ele- 

30 ton for each executable statement. These 
code Nektons are machine-oEdepeadent and 
are not to be confused with the code skektons 
used fay the code gaierator to generate mach- 
ine code for a specific maichine. Some ^ecut- 

35 able statemmt compiler routines arrange for 
the assignment of index registers and analyze 
a larger part of the program for posable opti- 
mization. The Expression Processes optimizes 
at the low level of lodking for di:^licate triads 

40 v^di can be eliminated. The symbol table 
is used to store information needed for opti-r?^ 
mization such as the last location in a pco- 
gram at which a variable changed value. 



-cQMicepaiallyythe II>— 2 code resulting.frcHn^v .^t/.,- 
problem transformation is machine indepen- 45 
dent The language transformation to a 
specific machine is accranplished by the second 
Language Translator, as MCT 14 of Fig. 1. 
The purpose of this concept is to limit the 
impact caused by a change in the object com- 50 
puter (the computer on vMch the program 
being compiled will be executed). This coti- 
cept requires that the problem transformation 
output to tibe code generator be defined and 
designed to be set of machine operations 55 
^fhkh any of a dass of machines can perform 
by some sequence of its own sj^fic rnachine 
instructions or micro-program instructions. 

Implementation of IL'Nanexecutai?le 

Information 60 

The imi^ementation of this Generalized 
Compiler may ixtilSze current ccanpiler sub- 
programs and techniques to the extent possible 
without compromising this separtaion of lan- 
guage trandation from problem transforma- 65 
tion. The IL format for nonexecutable infOT- 
maticHi is a symbol table, as 13 of Rg. 1. The 
symbol table can be a conyendwial symbol 
table. However, it requires more fields than 
the symbol table for any one language since 70 
luyt all languages have ^e same set of non- 
executable information. For example FOR- 
TRAN does not use level numbers but Cobol 
and PL/1 do. 

Table 2, shown below, indicates a li^ of 75 
the infocmadon to be incorporated into the 
symbol table. 

This list can be reviewed and refined dur- 
in^g writing of the compiler. It gives some per- 
spective into the size and limits of the ta^. 80 
It is estimated that an average program would 
;have five to six htmdred statements and two 
thousand identifiers. 



Group 



TABLE 2 
Symbol Table Fields 
1. Expression 

Field Contents 



General 



Identifier Name 

Starting Address of Expression 

component String 
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Group 



Type 



Expression (Continued) 

Held Contents 



Statement Function 
External Function 
Atgument 
Triad 



Compiler Notes 



Last location in program at which 
value of expression was computed 



Group 



General 



2. Variable 

Held Contents 



Identifier Name 
Level No. 
Sign 

Complement (-Unary) 
Protect Read/Write 
Link to Additional Information 
Scope of Definition in Program 

Scalar 

Elementary 

Label 

Group Name 
Argtunent 

Temporary (Compiler Use) 
Pointer (Left or Right) 

■•'•Array*"'-* '-t.-^'^-^?' - ' ■■ ' ■*■ 
Structure 
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2. Variable (Continued) 
Gioiq> Field Contents 

Class (Cont.) Fbrward linked list 

Double linked list 
Tree 
Branch 
String 

1^ Syntax Type Label 

Dedmal/Binaiy 
Fised/Floating-Point 
Real/Complex 
Aridunetic 
Logical 
Relational 
Condidon 
Status 

Subscript Name 
Justified Left/Right 

Storage Mocadon Info. Length (PrecisiQn) 

Length — Fractional Part 
Size in Each Dimension 
Structural Size 
Synchronized 

Condguous to Preceding Item? 
Qfi&et from Start of Structore 

■ ^■-;-.?..^,-.K.'..:>..;^.vr^- - - ' Padoed ^. . - 

Ellcr 
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Z Variable (Contmued) 
Group Keld Qmtents 



Storage AUocadpn Info. (Cont.) 



Compiler Notes 



Common 
Equivalence 
Initial Values 

Starting Address-Assigned Storage 
Storage Space Required 

Defined at Location 

Used at Location 

link to Next Same Type or Level 

link to Last Same Type or Level 

Value Qbanged at Location 



3. linkage 

Group Field Contents 

General Identifier Name 

Relative Address in Program 
link to Additional Arguments 
' list of Dummy Arguments 

Return 



Type aitry 

External 
Function 
Sub-Routine 
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4. Constants, Literals, Character/Bit Strings 
Groqp Field Contents 

General Identifier Vahze 

Assigned Address 
Length 

Length-Fractional Part 

Type Constant (Numeric Literal) 

Character String 
Bit String 

5. Program Structure Label 
Group Field Contents 

General Identifier Name or Number 

Program Location (Etelative) 
Start & End 

Statement Sequence No. 

Defined 

Used 

Nesting Level 

Type Number/Name 

Statement 
Faragrapb. 
Section 
Division 
Do-End Group 
B^in-End Giroup 
Procedure 
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6. I/O Format 
Group Field ContEnts 

General Identifier Name or Nomber 



Assigned Address 
External Foonat (Pictare) 
Record Size 
Block Size 
Unit Assigned 

Type Format 

Picture 
Heading 
I/O Qmtrol 
File Control 
Buffer Qmtrol 

Umt Tape Unit 

Card Reader 
Punch 
Printer 
Console 
CRT 
Disk 

7. General Registers 
Groi^ Field Contents 

General Symbolic Name 

Roister Assigned 
Starting Address 
Freed at Address 
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Gtoup 



7. General Registers (Continued) 
Field Contents 



Type 



Posh-Down Stack 

Pointer 

Index 
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In the symbd table, die identifiers are 
separated into ffotsps accoirding to the types 
of infoiimatioa fields needed for die identifier. 
The groups arc: 



1. 

2. 
3. 



5. 
6. 
7. 



Expression naine (Statement Function, 
Triad, Argument), 
Variable name 
Linkage — 

Entry, External, Funcdon, Sub-routine 
Constant^ Literal, Character Stringy Bit 
String 

Program Structure Label 
I/O Format 
Register UtilizadQn 



IL Impiementatim — ExeaUabJe Statements 
Basic Structure: The Source Language 
Translator 10 tranMates all executable scate- 
ments of the program being ccmipOed, into 
IL— 1, ^diich is die hig^ kvd language 20 
fomaat of the input to the FroWem Trans- 
foamatran Module. The oiitput of die Prob- 
lem TransfonnadoQ Module is in a macbine- 
levd intmnediate language called IL — 2, 

In the £L— 1 fonnat, executal^e statements 25 
conast of keywords and aigument& Argu- 
ments may be either simple items or expres- 
sions. Espressions are sequences of operand- 
operator pairs. Operands n:iay be either single 
items or expressions. Fig. 5 shows the struc- 30 
ture of executable statements in the First 
Intermediate Language F(»mat, IL — 1. 



The general format is: 



Where: 



Argument 
Element 



Operand 
Operator 
EOE 



Keyword -j^argument EOE 



■ element expression 

: symbol table address of a single item 



•s^operand operator ^ 



EOE 



= symbol table address of an element 
or an expression 

= code for die action to be performed 
on the related operands 

= code for "end of expression" 



KeytDords: Every executable statement in 
35 H^l has as its first word a keyword. The 
keyword specifies die tdcation of the compiler 
routine used to process the arguments that 
follow. There is one keyword in IL — 1 for 
each Source Program function for which com,- 
40 piling capability is required. Only (me routine 
in die G. C is required for any one fimctitm 
even thoui^ eadi source langiiage may state 
the function in a differoit manner. 



For example, the Cbbol statement, "add A 
to B giving and die Fortran statauent, 45 
"C=A+*B" borii speidfy the same ftmction, 
namdy the assignment function. Therefore, 
regardless of the way the function is origin- 
aUy stated) the same routine can process diis 
ta^ once it is translated into the standard 50 
G. C input format, IL — 1. Table 3 indicarps 
the list of keywords in II^l, die First 
Intermediate Language. 
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TABLE 3 



IL Keywords (Highfir Level XL) 



EL Keyword 



Additional source language keywords 
which map into the EL Keyword and its 
arguments. 



Storage Control 

ALLOCATE 
ERBB 

Operator XnstmctioQs 

DISPLAY 

Conditions 



ON 

SIGNAL 
REVERT 



REPLY, EVENT, PAUSE, ACCEPT 



SNAP, SYSTEM 



Program Control 

DELAY 

STOP 

EXIT 



WATT 
END, RUN 



1/0 Commands 



OPEN 



OPEN (Cont) 



CLOSE 



FILE, DIRECT, SEQUENTIAL, 
BUFFERED, UNBUFFERED, STREAM, 
RECORD, INPUT, OUTPUT, KEYED, 
EXCLUSIVE, BACKWARDS, TITLE, 
PRINT, LINESIZE, PAGESIZE. " 
LOCK, REWIND 
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EL Keywords (Higher Level IL) 

AddtdoBal source language keywords 
which map mto the EL K^word and its 
IL Keyword arguments. 



1/0 Commands (Gont.) 



UNLOCK 
LOCATE 

GET 



UPDATE 



PUT 



GENERATE 



FILE, SET, KEYFROM, FIND, REWIND 
BACKSPACE. 

READ, FILE, INTO, SET, IGNORE 
KEY, KEYTO, COPY, SKIP, LIST, 
EVENT, NOLOCK, NAMELIST, 
ADVANCENG, REWIND, RECORD, ENVALID. 
FILE, KEY, EVENT, DELETE, REWRITE 
FROM. 

WRITE, FILE, PAGE, SKIP, LINE, 
DATA, EDIT, LIST, FROM, KEYFROM, 
EVENT, ADVANCING, PUNCH, PRINT. 
INITIATE, TERMINATE, REPORT. 



Problem Execution 



ASSIGNMENT 



DO 

GO TO - 
ALTER 



(dement array, structure,) COMPUTE, 
ADD, SUBT., MULT., DIV., BY NAME, 
ROUNDED, ON SIZE ERROR, GIVING, INTO 
FROM, CORRESPONDING. 
WHILE, TO, BY, PERFORM, UNTIL, TIMES, 
THRU, VARYING, FROM, AFTER, FOR, 
w^DEPENDDJGON 

TO PROCEED TO, ASSIGN. 
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TABLES 
IL Keywords (Higher Level IL) 



IL Keyword 



Additional sonrce language keywords 
which map into the IL Keyword and its 
arguments. 



Problem Execution (Cont.) 



10 



15 



20 



25 



IF 



MOVE 
EXAMINE 

TRANSFORM 

CXJNCATENATC 

SEPARATE 

CONTINUE 

CALL 

RETURN 



THEN, ELSE, IS NOT, NUMERIC, 
ALPHABETIC, POSITIVE, ZERO, 
NEGATIVE, NEXT SENTENCE 
GET, PUT, CORRESPONDING. 
REPLACING, TALLYING, ALL, 
LEADING, UNTIL, FIRST. 
CHARACTERS, FROM, TO. 



TASK, EVENT, PRIORTTY, ENTER, LINKAGE 



Operators', To meet the specificatioji that 
IL— 1 be source language independent^ the 
IL must describe all of the functions which 
can be described by any source language oi 
practical interest Such a task can be hope- 
lessly diflScul^ lesnfdng in an excessively 
large compiler if every source language fea- 
ture of every language is treated as a separate 
function. It is impoitant to analyze the source 
lai]^;uage features to determine what functions 
are actually invcdved. 

The most elementary iunctions involve ele- 
mentary operations to be performed on oper- 
ands. These actions are specified by operators. 
For the IL to be language independent^ die 
G. G set of operators must encompass the 
operator sets of die various source lai^uages of 
interest From a practical, standpoint, three . 
source l a ngu ages,' Coboi, Forti^ IV, and 
PL/1 have brai considered in the devdop- 
meat of the G. C Refinements;, such as 
the addition of a special operator from sane 
other source language, can be eaaly evalu- 
ated 

In addition to die coixventional operators 
from tiie above source language which specify 
actions ott data, delimiters, needed in IL, are 



included. They are considered to be operators 
(cmtrol operates) since they also specify 30 
action to he taken. 

The list of operators in the G. C also in- 
dices three new c^)eratare created for inq)Ie" 
mentation <rf IL. They are the prefix opeara- 
tors: .A., .L, and .O. . AH of the^ operators 35 
relate to the addresses of the data involved. 
The first operator, JL, specifies the operatiaa 
of determining the address <rf the associated 
operand vahie as the address of die desired 
data. The third operator, .O,, specifies that 40 
the^ operand address is an offset address to 
which a base address is to be added. 

With these new operators, it is felt that the 
nmnber of soinre language functions which 
can be described deariy and simply without 45 
. special, routines is considerably increased. Fot 
example, PL/1 has the "Pointer'' and "Off- 
set" features which are classified as special 
data types. The IL address opeaators can 
dearly specify and process each of these 50 
througji die ExjHcssion ProcesKir by treating 
die address as a function of r^ular data. 
Likewise, list procesang source laggiiagps 
create luied lists ^ch can eaaly be trans- 
lated into IL using die address operators. 55 
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Table 4 sluvws a list of the opeiatars in IL (left to rigjit) basis. As an excepti(ki to tiuV 

and the precedence lelation^ps. For opera- rale, Procedeoce Level grmq> 2, the prefix 

tors in the same precedence group, tiie order operators;, are processed in a ligjit to left 

of occurrence in an expres^on estd>lisiies the order, 
pcecedotce in a Jtet coane, first executed 

TABLE4 
IL Operators and Precedence 

Precedence 

Levd IL Operators DescdtptiQn 



0 


EOE ) , .O. 


CONTOOL& OFFSET 


1 


** 


EXPONENTIAL 


2 


-hPrefix -Prefix jiot 1 A. 


J. PREFIX 


3 


* / 


ARITHMETIC 


4 




ARITHMETIC 


5 


II 


CONCATENATION 


6 


>^ = ^ < 


RELATIONAL 


7 


& 


BIT STRING "AND" 


8 


1 


BIT STRING "OR** 


9 


.and. 


LOGICAL "AND" 


10 


.or. 


LOGICAL "OR** 


11 


( FN (Function Code) SB (Subscti 


ptCode) CONTROL 



|EOE( ) .0. , FN SB)j-ARE ALL CONSIDERED TO BE CONTROL OPERATORS. 
TftEY ARE NOT A PART OF THE OUTPUT CODE. 

Operands: Operands are the symbol taWe language translator adds a special code which 

addresses of die identifiers in the user's pro- means "no q»erand" whenever necessary (pre- 

gram. Identifeis wtoJi axe die names of ex- ceding the fim operator ot m between two 35 

presslons are kept in a separate area of the consecutive operators). 

15 symbol table so that the Expresdon Processor In case the i»ecedence of operations in a 

caa recogmze expressi(His within expressaons source language is different from die prece- 

with a minimnm of effort dence conventic»)s ad^yted for the IL, the 

Expressions': Expres^ons in ILp— 1 fonnat. Source Language Translator for that language 40 

the input to the Problon Transfonnation must msert parentheses as necessary into the 

20 Modul^ are simply, nearly one-to-one trans- soiHce p rogr am cxpresaons during translation 

formations of source language expressions, into IL unless provision is made in the G. C 

Operand names are rcpiaoed by the symbol to accqrt a user definition of the precedence 

tal^ addresses of theiiaames'Opafatw^^^^ ' "-^ — . -....-r .... r 45 ^./.Mi:,.^^!/*^:^^^^ 

of tiie source language are replaced by the Arguments: The arguments in ID— 1 re- 

25 opanator code equivalent in IL. In order to present the hrfonnation content of the execut- 
avoid duplication of the language translation able statements of the user's program. Inas- 
task of symbol recognition in die Problem much as IL is not exposed to the program- 
Transformation module 12, the expression mer except during the writing of tiie com- 50 
structure is deagred so tiiat every odd num- piler, most of the redundancy amunon in 

30 bered wwd in any expressicHi is an operand source languages has been removed. An argu- 

and every even numbered word is an opera- ment may be a single value as a DO loop 

tor. To accompli^ diis format^ the source parameter or it may be an expression such as 
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Ae expression to tie ri|ht of the equals sign 
in an assignment statement or an expressitm 
which represents the value of a parameter. 
EOE: EOE is the symboHc name used to 

5 represent the internal code used in XL— 1 to 
delimit (delimit=to separate) egressions and 
arguments. The code i^ed is one of the codes 
not assigned for character representation in 
either the ASCII or EBCDIC code. This code 

10 is used to pop-v^ the pu^-<iown stack of the 
Expression Processor to the next level at the 
end of processuig in exprcssicm and to ad- 
vance the statmcnt processing routine to 
the next argument at tibe end of compiling 

15 an argument. 

Statements: Executable statements in a 
source languagg specify mappings or trans- 
f onnation to be performed on data during 
execution of the user*s program. These trans- 

20 formations are functioim whidi are indepen- 
dent of die source language is q)edfied. The 
Source Language Tr^islator 10 transforms 
the ftmctional values and expressions oi the 
source statement into the leq^ective argu- 

25 ments of an IL — 1 statement. The arguments 
in an EL — 1 statement are the values or ex- 
pressions to be processed by the Problem 
Transformation module^ 12. 
For example, the source language statement 

30 "GO TO 101" specifies the unconditional 
branch functi(m in Fortran. In some other 
source language, the keyword might be 
"BRANCH TO" P. (or any other symbol) 
without affecting the function to be per- 
formed. Likewise die identifier mig^t be an 

35 alphanumeric name or an expression without 
changing the function. 



' Sosne cbmphcat^ functions^ whkh can fx. 
expressed in cme sentence, are equally speci- 
fied by any one of several dif- 
ferent sets of argcnnents. Fw ex- 40 
ample, the test for conTimu'ng the 
iteration in a DO loop may be q^^edfied as 
the upper limit of the loop variable or it may 
be specified as a test of a logical expresa<Hi 
in v^ch case iteration continues for as hmg ^ 
as the value of the logical expresston is **true". 
It seems mane practical to handle this type 
of situation in IL — 1 by providing argument 
positions in the EL — 1 statement for each 
argument of each alternative radier dian im- 50 
posing a more cooc^^ted langi^ge trandar- 
tion from source language. The Problem 
Transformation Modde 12 recognizes eadi 
argument position by the EOE code ^^ch is 
present even for aigoment positicms vdiidi 55 
have no argument ErxOT detecting logic is 
used to insure the presence of a suitable set 
of arguments. Tables 5(a) and 5(b) are ex- 
amples of FORTRAN and COBOL source 
language iteration statements expressed in a 60 
newly developed Metalanguage \diich is de- 
scribed sub^quendy hemnafter. Table 6 
shows the PLr— 1 source language iteration 
statement in the Metalai^uage notation, while 
Table 7 shows, in Metalanguage notaticm, the 65 
single, gencrdiizedj First Intermediate Lan- 
guage (IL — 1) iteratiwi statement into windi 
each of the Fortran, Cobol, and PL/1 state- 
ments of Table 5 and 6 map. 

Tables 8(a) and 8(b) ilhistrate how die 70 
iteration! statements from FORTRAN and 
COBOL map into this single IL — >1 statement, 
in Metalanguage notation. 
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FORTRAN Iteration Statements Written In New l^ietalanguage 
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agtmti DO astmts ilO 



fkipn rJip3-| 
lipl y \ip2 J L^p3 



J 



astiDts I)0 astmu ill 



_ rkip4"l rkq>5"> |-Jkip6-j 
lip4 J tips j Iap6 J 



aBtmts 



rkip7-| fkipSI r-Mv9-i 
DO aatmt, U2 = ^ -{ Y \ 



astmttt — 



(executable program statements) 



aatmU — 

'S ,:.t ■■ ■■ -y 



astmta — 
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COBOL Iteration Statements Writtm In New M^alangoage 
PERFORM aproci ^THRUaprocaJ [{^10^} TIMES J 

PERFORM aproci |^THRU aprooaj UNTIL Elj 

PERFORM aproci jrHRUaprocaJ VARYING vnl FROM 

{knll |ial2i -r 0^3 1 

^ BY ^ V UNTILEli AFTER vn2 FROM ^ V 
vn4 J [ynS J -L \m6 J 

Q UNTIL ElgT AFTER vn3 FROM -j I 
-I l^vnS J 

I UNTIL EIsJ 



TABLE 6 

PL — 1 Iteiadon Statements Written Li New M^alangoa^ 
al: DO vnl = mnl TO nm2; 



a2: END [aQ ; 

al: DO ; 

a2: END [al] ; 

al: DO WHILE NOT Ell ; 

a2: END [al] ; 



BY 



BY 



al: DO vnl = nml [TO nm2 [BY nm3]] 
[WHILE NOT El J ; 

a2: END [al] ; 
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TABLE? 

XL — 1 Iteration Statement Written In New Metalanguage 
DO cl al a2 vnl nml nin2 nm3 £1^ 
[a3 a4 vn2 nm4 nm5 nm6 £1^ 
[a5 a6 vn3 mn7 mnS mn9 HJ] 
WHERE: 



cl if zero, test before iterating. If one, iterate 
before testing. 



ral,a2,a3, 1 
|^a4^a5, aS, J 



are statement sequence numbers assigned by the 
source lai^uage translator or symbol table 
addr^ses statement^ pai^graph or section 
labels. If al, a3, a5 specify paragraphs or sec- 
tions, a2, a4, and a6 are blanL 



vnl, vn2;, vn3 specify DO loop variables. 

nml, mn4, mn7 initial values of DO loop variables. 

mn2, mn5, mn8 the limiting values of tiie DO loop variables. 

mn3, nm6, mn9 die value by which the DO loop variables are 
increnientnl (or decronented, if n^ative) after 
each iteration 

Ell, ^^2> logical egressions which are tested before each 

iteration. Iteration continues until concession 
is true. 



MAPS 

SOURCE 

FORTRAN 

a stmt -1 

kipl tiuru kip9 

ipl thru ip9 

ilO 

ill 

il2 



TABLE 8 (a) 
Iteration 



n^i 

a — i 

mnl thru mn9 
mnl thru mn9 
vnl 

vn2 ' 
vn3 
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TABLE8(b) 
Iteration 

MAPS 

SOURCE n^i 



COBOL 



Qproci 


al 


^procs 


a2 


kiplO 


/mnl = 1 
\niii2 == kiplO 


iplO 


/mnl = 1 
\mn2 = iplO 


vn4 


mnl 


vnS 


mn3 


vn6 


nm4 


vn7 


mn6 


vn8 


nm7 


vn9 


mn9 



/i^— 2 PoTmat 

The output of the Problem Transf oraiation 
Module 12 consists of executable isstructioDs 

5 in IL^2. Unlike XL — 1, which is in hig^ 
level language^ IL — 2 is at the machine opera- 
ixon lever \ciiich means that the tran^tioa to 
a specific ' machine code from IL — 2 is ap^ 
pixxrimately cme-to-one. The Pioblem Trans^ 

10 fonnaticm Module 12 transfomis each state- 



mait from IL — 1 format into a sequence of 
instructions each of which consists of a macro 
call or an operation code and the one or two 
operands involved in this operaticm. Table 8(c) 
is an example of the IL — 2 skeleton fnwn 15 
whkfa die ILr— 2 ii^tnicdon sequence wiE be 
generated by die Generalized Compiler 'as per 
the sequence of tasks listed in Table 8(d) as 
the map[»ng oi die IL — 1 statment of Table 
7, 20 
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TABLE 8(c) 

JLf—2 Instnicdon Sequenoe Table far IL — 1 
Iteration Statement (Code Skeleton) 

STMT STMT 

NO ADDRESS INSTRUCTION 

1 vnl = mnl 

2 vn2 = mn4 

3 vn3 =mn7 

4 GO TO a7 

5 al CONTINUE 

6 IF vnl >nm2 GO TO a2 

7 IF vnl <mn2 GO TO a2 

8 IFvnl>nm2ANDnm3> O OR 

vnl<nm2 AND mn3<0 GO TO a2 

9 a3 CONTINUE 

10 IF vn2>mn5 GO TO a4 

11 IF vn2<nm5 GO TO a4 

12 IF Yn2>nm5 AND mn6>0 OR 

vn2<nm5 AND mn6>0 GO TO a4 

13 a5 CONTINUE 

14 IFvn3>mn8GOTOa6 

15 IF vn3>mn8 GO TO a6 

W IF vn3>mn8 AND nm9>0 OR 

vn3<mn8 AND mn9<0 GO TO a6 

17 IFEliGOTOa2 

18 IFEljGOTOa4 

19 IFEl,G0T0a6 

20 a7 CONTINUE 

21 <:mtemehts to be iterated from a5 to a6> 
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TABLE8(c)(Coiit) 

STMT STMT 

NO ADDRESS INSTRUCTION 

23 GOTOaS 

24 a6 CONTINUE 

25 <statemeiits to be iterated fixxm a3 to a4> 

26 vii2 = vn2 + nm6 

27 GOTOd3 

28 a4 CONTINUE 

29 <stataneDts to be iterated fixun al to a2> 



30 vnl — vnl + mn3 

31 GO TO al 

32 a2 



Table 8(d) shows the tasks pexfonned by 
the Generalized Gompiler GC— 11, with re- 
spoct to mapping the EL — 1 iteration state- 
5 ment of Table 7. 

Table S(d} 
Generalized Compiler Tasks 
(Statement numbers refer to statements in the 
IL — 2 Instruction Sequence Table) 

10 Task 

1- <If al is blank, go to ta^ 30 (cttot 
exit)> 

2. <If a2 is blai*, al is the symbcd table 
address of the pnK:edtne to be iterated. 

15 Lode iq> the a2 field in the symbol table 
word for this procedure: Set up the code 
to call this procedure (mjc siiown in die 
IL — 2 instructioQ sequeiice)> 
3- <If a2 is given, lode Tq> in . the symbol 

20 table to detecmine if a2 is a proceduie. 
If it is, replace tbe procedure address 
specified as a2 wiiii the a2 fidd in the 
symbd table word for this procedure. 
Set up tbe code to call diese procedures 

25 (not shown in tbe IL — 2 instrucdon 
sequence)> 

4, <If mnl is blaiik and nm2 is specified, 
assign a value of 1 to nml> 



5. If a3 is blanl^ set a3 equal to al, a4 
equal to q2, mid delease statements 9, 25s 
27, 28> 

6. <If a5 is blank, set a5 equal to a3, a6 
equal to a4, and ddete statements 13, 

21, 23, 24> 

7. <if inn2 and mn3 are terth blank, ddete 
statements 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 

22, 26, 30, 31 and go to task 29. This 
is a single iteration re^gardle^ of whedier 
Ell is spedfied> 

8. '<If cl is zero ddete statements 4, 20> 

9. <If Ell is blank, defcte statements 17, 
18, and 19 and go to task 11> 

10. ;<If mnl is blank, ddete statements 18, 
19> 

11. <If mnl is blank, delete statements 1, 2, 
3, 6, 7, 8, 10, 11, 12, 14^ 15, 16, 2^ 
26, 30, and gp^ to task 29> 

12. <If mn3 . is blank, assign the value h . 
to it> 

13. <If mn3 is a poative literal, ddete 
statements 7 and 8 and go to task 16> 

14. <If mn3 is a native literal, ddete 
statements 6 and 8 and go to take 16> 

15. Kdelete statements 6 and 7> 

16. <If Ela is blank, ddete statements 18, 
19 and go to task 18> 

17. '<If nm4 is blank, ddete statement 19 > 



25 



1,367,741 



25 



10 



15 



20 



30 



35 



45 



50 



3, 10, 11, 12, 14, 15, 16, 22, 26, and go 
to task 29> 
19. :<If nm6 is blaok, as^gD the value, 1, to 
it> 

2a Klf iim6 is a poative literal, ddete statfr- 
meats 11 and 12 and gp to ta^ 23> 

21. Klf iim6 is a ne^tive literal, delete 
statements 10 and 12 and go to tEi^ 23> 

22. <delete st^emeats 11 and 12> 

23. <If Ela is bhxsky ddete statement 19> 

24. U mal is Idank, delete statements 3, 14, 
15, 16, 22 and go to task 29> 

25. .<If m^ is bl^ik, asagn a value of 1 
to it> 

26. <If mn9 is a positive literal, ddete state- 
ments 15 and 16 and go to task 29> 

27. <If nm9 is a negative literal, ddete 
statements 14 and 16 and go to ta^ 29> 

28. <ddete statements 14 arai 15> 

29. <geaerate IL — 2 output code sequence 
for the statements not ddeiBd> 

30. <error exit> 



General Chaiacteristics of the Intermediate 
25 Language 

Table 9 lists the characteristics of the IL. 

Table 9. 

Gharactenstics of the Intexmediate Language 



1. It is a source language independent 

2. The inpu^ EL — 1, is fimction-oricnted. 

3. Esecutable mpvA statements, in IL — 1, 
are in higher levdt language format 

4. There are no data dedaratioo statements. 
All data names and attributes aie m a 
symbol table. 

5. Alt data names not mdidtfy defined 
automatically assume default attributes. 

6. AH names (operands) are symbol table 
addresses. 

40 7, Operator codes and punctuation codes 
are suffident to describe alt generalty 
ised source lairiguage operations. 
Every executable input statement starts 
with a standard keywoni ft^ the function 
to be executed. 

9. It is indepeodoit of the object computer. 

10. The output IL — 2, is computer opeiation 
oriented. 

EL — 2 output code is at the machine 
operation level 



8, 



11 



Tables 5, 6, 7 and 8 provide an example 
~" of the transfoimatidns ' mvclvcad '* using" IL. 
Tables 5 and 6 sbonr various source lai^uage 

55 statements used in FORHIAN, CX)BQL and 
PL— 1 to specify the iteration fimctioa Table 
7 shows the IL — 1 format vrhidi can represent 
any of the various souzce statemmts shown 
in Tables 5 and 6. 

60 In other words, a common iterative function 
can be specified by a FORTRAN *DO'* loop 
or a COBOL *TERFORM" statement or a 



-^TK>-c-=:---END" group <tf PL/l,^Therefor^ — " - 
any of the above source language f oimats for 
iteration can t% easily translated by SLT 10 65 
into a sin^e format^ IL — 1, and this standar- 
dized statement can he prooessed by a Gener- 
alized CGanpiler GC — 11. This Generalized 
Compiler GC — 11 may be deagMd as a 
module such as the ALU of the host ccan- 70 
puter or it may he designed as a piece of 
peripheral equipment or a set of sul^outines. 
Such a de^gn may be economical^ feasible 
in tiaidwaie because of the wide application 
of the same compiler to many computers and 75 
many source languages. In this example for 
iteration, the intexmediate langua^^ input for- 
mat^ IL^l, at the input to the GO— 11 is 
shown in Table 7. 

Where: The tetters specify dummy van- 80 
ables to be rej^aced by scope of definiticni, 
indexing lat^els, index range lunits, increments 
and conditions. The output of the GC — 11 is 
LI— '2 and oozsists of mafhitre' level instruc- 
tioos (sixdi as: compare, increment, branch, 85 
and test) inserted in the proper sequence and 
locations in the text of the problem prc^ram. 
Although the output is at the machine code 
levels it is functicm-oriented and madiine in- 
dependent The translation of this ouqpnt into 90 
a specific marhinc code is, in most casesi^ a 
simple task of table look-up. Table 8 ^ows the 
IL^Z oidput instructions iixto which the H^l 
statement of Table 7 is tranrfonncd dur- 
ing proUem transformation. In addition, the 95 
compiler system may add ixstnicticHis to save 
and restore the contents of index registers as 
needed as other supporting operations. 

Advantages of the Computer System Using 

Generalized Compiler Concept 100 
The advantages of this compucer system are 
prinmily the result of organizing the work 
into independent decentralized functions and 
isolating these ftmctionsL As a remlt» die fol- 
lowing advantages are obtained. These are 105 
briefly outlined first and then discussed in 

Ad'oantages: 

1. Compiler design cost is less because the 
compiler is leefut over a broader range 

of appKcation leailting in eccmomies of 110 
volume. 

2. Compiler design can be optimized and 
refined by iteration of the results of imtial 
dedgn because the broader use and longer 
Hfetime produce suflScient returns from 115 

' e^ " " - 

3. Compiler is simpler to design and ddbv^ 
because it is orpnized into independent 
decentralized modules. 

4. Qmipile time can be substantially reduced 120 
by hni^emettting the design in hardware- 
firmware. 

5. Operating system cwnplexity can be re- 
duced because of the reduced demands on 

the system by the oanpiler, 125 
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7, Better language features will pennit more 
program optimizaticu! by the programmer. 

8. It is compatible with the development of 
5 common data base systems and other cur- 
rent projects which may radically alter 
compiler design specifications. 

Cost: The current compiler deagn ap- 
proach requires a separate unique oomiMler 

10 design for each source language/machine cooi- 
bination. Each compiler has relatively litdc 
utilization and a short lifetime. In ccmpari- 
son^ a large part of a generalized compdler can 
be used without change with numerous source 

15 languages and various makes and generatioiK 
of comimters. Therefore^ the cost of the de- 
sign is distributed over a larger number of 
units and is proportionately less. 
Compiler Improvements: Because of the 

20 comj^exity of current compilers and their 
limited applications, once a com|»ler desLga 
is debugged, the design effort gqaOTlly ceases. 
CSianges are strongly resisted because of the 
ri^ of creating unsuspected new problems. In 

25 addition, it is not justified to redeagn in 
(Hder to oveiC(Hne disadvantages discovered 
during operation because the potential im- 
provement will benefit only a rdatively small 
user group. In compari^Mi, extensive analysis 

30 af the Problem Transformationr Module of the 
Generalized Compiler during use is justified 
because it perfonns standard functicHis com:- 
mon to all onnpOers and the benefits of any 
improvement will be widespread. 

35 Simplicity: The concept of separating the 
mdep^uient parameters of langirage ttansla- 
tiom and problem transfcHmation into separate 
modules* effect, decentralizes die controi 
from (me compHcatcd control and flow se- 

40 quence into three independent sim|^^^^^ 
As a result; design del^ls do not impact' the 
v^bxAe compiler but are dearly limited. Theres 
fore^ designing and debugging are easier. 
Speed: Because the Problem Transforma- 

4^ tioQ Module is independent of both source 
language and objeKn: computer, it can be used 
with many source languages and many com- 
puters. It will not change with changes in 
either source language or computer machine 

50 codes. 

Therefore, it is fea^ble to d^ga this imit 
in hardware-firmware providing a pofeentsal 
speed improvement of several orders of mag- 
nitude. This portion ci the compiling task 
55r ^can .be. perfcmned.. at .nearly madhine .^cycle., 
speeds. 

Reduced Supervision: Operating System 
complexity and the resulting time spent by the 
computer for operating system duties (n<m- 

60 {voductive expense) are bsscoming major prob- 
lons. As languages beccmie extended, cc»n- 
pileis become larger and require more operat- 
ing system effort to schedule and to swatdi 
to and from active storage especially in a 

65 multi-prograDoming environment. 



loL the G^eralized Compiler^ the Lai^uiage 
translators (SLT 10 and MCF 14} are much 
smaller and dierefoie eader to sdiedule and 
switch. The scheduling of the ALU is greatly 
reduced especially if the Problem Transfor- 70 
mation Module is impkznented in hardware. 
In additicm, the switdiing and active storage 
needs are reduced. It may now be feasiHe 
to perfoim the entire compile task in peri- 
pheral equipment^ ftather simplifying the 75 
operating system. 

Source Language Impraoements: In die 
Generalized Qmipiler, the Somce Language 
Trandator is the only module affected by a 
change in the source language. The tranka- 80 
tion in this module is a simple ai^roximately 
one-to-one translation into intennediate lart- 
guage. Therefore, source language changes to 
better adapt the language to the user are 
rdatively easy to implement. Likewise;, die 85 
impact on the ccHupder is stricdy coi^Bned 
wtudi means diat the ri^ of adding unsus- 
pected errtMs is proporrionately reduced. 

Better O'^imizatian: Small language tnans- 
iatOTSs as in the Generalized Compiler, are 90 
easier to transfer into sad out of active stor- 
age, and consequendy source program switrh- 
ing from one language to another within a 
program may be practical This would allow 
the programmer more flexibility to optimize 95 
either the writing or execution of a program. 
Since the programmer knows more than the 
compiler deagner abtHit die context of the 
problem beii^ pn^rammed, the programmer 
is m the best poation to optimize the pro- 100 
gram globally. Local optimizatiori, im die 
other hand, is probably best dtme by the 
compiler. 

Reduced Obsoiescencei Because the Gener- 
alized Compiler is divided into independent 105 
modules^ the impact of a change is stricdy 
limited. Likewise^ it can be adapted to radi- 
cal changes. For example, the work now 
being done on data base management and a 
common data base for a ccdmrnsodty of users 110 
may lead to the separation of compiler tasks 
into two spedaHzed comiMlers, oa& of vdiich 
compiles data attributes and environment 
information, and the other whidi compiles the 
executable procedures. The Generalized 1 15 
Compiler is not only compatible and adapt- 
able to this concept, but may accelerate die 
development 

. Description: New Metalanguage - — 

From general considerations, it was fdt 120 
de^rable to provide: 

(a) Separation of data attribute dedarati<Mi: 
from executable statements into a separate 
cranpilex functi(Hi to provide a run-time ^m- 
hci table or a data base so that data attri- 125 
butes can be taken into consideration during 
program execution. In other words;, make 
provision in the compiling process to handle 
the cwnmon data base. 
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- - (b) ' - ^Develop the iGeneralized GHnpiler 
coocept of EL--^1 so that simple translators 
can be designed for each souice language, 
(c) Develop a software writer^s intenne- 

5 dme language^ IL — 2, so that opemtmg sys- 
tems and compilers can be written and de- 
bugged in a simpler fa^oa by ising an in- 
tennecHate language. 

In oxder that tibis migjit be done^ it was 

10 felt necessary to develop a new Metalanguage 
in order to be able to precisely define the 
dements various KHnce languages. Since a 
good precise definitioa of a "problem'' is the 
best basis a good "sohitionf", it IS necessary 

15 that there be provided a new Metalanguage 
which can be used as a tool to organize and 
simplify complex higher levd language in- 
formation, ffyisting metalanguages either £^ 
to q>edfy this infasmatioQ precisely or fail to 

20 be concise. If the' soiHce language statements 
are not ^>edfied, bodi precisely and coiKisely, 
the inf ortnatioa content and, therefore, the 
mappmg requirtment will not be properly 
comprehended. 

25 TTius, in order to devdop, from a source 
language, a fuse iotennediate language 
(it—l) smtable for operation with a Gener- 
alized Compiler — and for developing a 
second intermediate language (IL — 2) vrfiich 

30 is suitable for easy conversion into the object 
machine code for any specijBc type of caca- 
puter it vm f dt necessary to dc^op a pre- 
cise method of handling and dpfinfng the in- 
formation content of all the major souice Ian- 

35 giiages. The new metalanguage is a txxd which 
aids in recognizing identical information 
among various source languages, by systema- 
tically labeling' the information content of 
each source language statement, so that iden- 

40 tical inf onnation units from various source 
languagqs:C8n be mapped into the same IL — 1 
function. 

Taking into consideration the problem of 
man-madune commuoicatioa betfreen soim:e 

45 languages (such as COBOL and FORTRAN) 
and softw^ writers it is seen that tiierc is 
required some sort of conmnmication link 
between these two demfnrs. 
Metalanguage;, which is here devdoped as 

50 die notati«ft used to specify dements in 
source languages, is the communication link 
between the man (the software writer) and 
the machine (the soiuce language). Possibly 
a major cause of the chronic failure of soft- 

55 ware to be delivered on time and to perform 
to spedfieatiOTisi is the'rusB ^f softivate^i^iDfde*^ 
to go to work OD the final product while 
ignorii^ the importance of devdoping good 
tools f«r the job. SiKh good to^ would pri- 

60 marily be good notation (Metalanguage) and 
good documentation. Alany poor software 
devdopments are the diicct result of the 
failure to pn^)erly evaluate jobs due to defi- 
ciency of notation or adequate tools for deal- 

65 ing with source languages. 



^ ■ ' to notation, not' t^y^ failsriic/v- 

to clarify the requirements, but also creates 
misconceptions about the complexities and 
relative importance of various components of 
the task to be done. Thus, poor woddng pro- 70 
grams oftoi are the result of poor cosnmuni- 
cation and poor commimicatioQ is very often 
the resuh of poor notation (poor Metalan- 
guage). Thus> good notation is extremdy im- 
portant in providing insigjit and good judg- 75 
ment for the software devdoper. 

A Metalanguage^ to be a precise and useful 
tool^ should be designed so diat it can pre- 
dsdy describe a lan^iage at each hier^chical 
level in terms of the next lower hierarchical 80 
levd. This approach is required to provide 
insight into die context of the problem by 
dimiinating the saturation ''noise" from lower 
levds in die hierardiy. • ' ' 

It is of great importance to recognize the 85 
infonnattcm content of the statements in the 
source languages so that the intermediate lan- 
guage (IL) will preserve the infonnation ccxi- 
tent and so that identical information content 
from different source languages can be recog- 90 
nized as sudi and mapped into the same in- 
termediate language statement without re- 
dtQidancy* 

One of the Metalanguages currently in com- 
mon usage to describe computer languages is 95 
called Backus Normal Foim, BNF. Although 
it appears to be useful for simple expressions^ 
it is not practical to use BNF to describe 
complex iuglier levd languages. In BNF, a 
pair of < > is used to delkoit each generic 100 
term, and funhennore, a full word is gener- 
ally used for each such tenn. This constitutes 
notational dutter and excess verbiage. The 
structure of the word used gives sto indica- 
tion of the amount of infonnation involved. 105 
For example: - ;s>rc,v^ 

the notatiot^ < integer > is "larger" than 

<tenn> although tenn refers to a miKdb 

larger unit of sofcomatioo. 
Thus tiie hieraidiial levels of information no 
cannot be easily labded wx identified. 

A predse notation can be an effective tool 
to aid in analysing langua^ to detemune 
the actual infomnaticHi specified. To be useful, 
the Metalanguage must be capaWe of spcc\r 115 
fyiqg the Imiguagp-bdng-analyzed conosdy 
and i»ecisdy^ and it must be easy to learn 
and use. The Metalanguage structure itsdf 
should assist by conveying a part of the in- 
formation; Thus the following requirements 120 
"are^stated "as the spedfications foF a'^MdaP 
language; 

1. The meaning must be unambiguous. 

2. It must be capable of describing higher 
levd languages preasdy and cwidsdy. 125 

3. It does not bury the major information 
units in a mass of detail which constitutes 
"noise** to the user. 

4. It is capable of specifying information 

at each hierardhical levd in terms of the next 130 
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lower level of inforaiaticm units. 

5. It is useful as a tool to psrovide in^g^t 
into the true nature of the problem involved. 

Nemly Deodaped Metc^anguage (ML) 

5 The Metalanguage newly dercloped is a 
set of symbols and rules designed to pennit 
the descriptioQ of statements and s^>anfica- 
tions of the syntax aiui the semantics rules 
of various higher levd languages such as 

10 COBOL, FORTRAN, etc. in a jwedse and 
a concise manner. In addition^ it is designed 
to minimize notational dutter and confusion 
so that it can be used as a tod in analyang 
the infoimati(Hi content of various languages^ 

15 The notational structure itself conveys infor- 
mation about the: (a) magnitude and (b) 
importance^ ^of .die infoimatioa being lepre- 
seated. The notation is designed to be easy 
to learn and easy to remember. Figures 8 and 

20 9 show language concepts organized into a 
hierarchy of magnitude aiid impartaince vnth 
the use of specid rotaticn to dMeientiate the 
hdracchy and meaning of concepts. 

Thus, a single lower case letter (Fig. 9) 

25 is used to rqpresQot a primitive information 
unit <m: ekment (any one of a set of similar 
elements). The sm^e lower case letter with 
a lower case subscript or sufifiz reipresents a 
subset of the set represented by the letter. 

30 Two major subsets ai all informatioit demits 
are "ideaatifiers** and "operators". 

It should be stressed that this described 
Metalanguage notation is exemplary only 
with fespect to the symbols and scope of 

35 defniticms. Odier syinbols and defimtion,- 
scape could be used to desaibe the same pre- 
dse method of organiziitg a hierarchy of con- 
cepts. 

Coinbiimtions strings of elemaits are 
40 specified by a sing^ capital letter. Thus, as 
seen in Figures 6 and 8, the letter "E" is 
used to specify one manber (any member) 
of the set of all expfessiaos; Sid^sets are sped- 
fied by »ibscripting lower case letters or mon- 
45 bers to the sin^ capital letter as seen by Mi 
or El. 

Subsets are also specified by suffixing lower 
case letters to the sm^e capital letter. ITiuSi 
"Eb" (Fig. 8) is any member of the subset 
50 that includes all logkal numeric expressions. 
In addition to die information conveyed 
by the structure (whether the lettser is upper 
case or lower case) the letters assigned to 
f - . various sets .ate j'Sdected for ease* of • learning 
55 and assodation with the set referred to. Thus, 
"E** is for expressions; "e" is for primitive 
dements; "o" is for oi^iators, etc. 

All of the above notation is for specifying 
"generic" sets of information. Jn order to 



specify a "particular" member of any set, the 6t) 
following rules apply: When a q>ecific mem- 
ber is specified^ it is considaied to be a 
literal of die Metalanguage and it is written 
entirely in capital letters^ if it is an identifier 
of 2 or more letters such as a reserved word. 65 
IMgtts and so^Hce language special symbols 
axe written without change oe extra notation. 
Any source language letters appearing in a 
htetal are written as capital l^teisi 

Since the Metalanguage symbols axe not 70 
the same as those u^ by the source lan- 
guages, there is no difficulty in distinguishing 
die source language synibols from Metalan^ 
guage symbols. Tlie only exception is the 
three dot (. . .) repetition symbol vdiich is a 75 
Metalanguage symbol also iKed as a JOVIAL 
language symb^ Because diis symbol is so . 
convenient and because JOVIAL is not a 
commonly used language it is fdt that T^^en 
JOVIAL repetition notation is needed it can 80 
be specified by a sub-satpted h^ . . .). 

Since the major source langpages of interest 
do not use lower case lettaran^ the Maalan^ 
guage lower case ktters said amibinatians of 
one capital letter combmed with one or more 85 
lower case letters, axe eaaly identified as 
Metalanguage. Scrhoc language words of odc 
letter or character which can be confused 
with the upper case Metalanguage notation 
(or the repetition symbol in JOVIAL) are 90 
specified by the lower case letter h with a 
subscript oonsisting of the character besng 
specified (either a ca^Htal letter or digit (mt 
repetition symbol <rf or other symbol). Tims, 
the source language letter "A" (Metalangqage 95 
literal) is thus ^)ecified as ''hj^*'. 

One feature of BNF vMdi has been inn 
coiporated into the new M^alanguage is the 
use of T^gHsih language vdiich is endosed-in 
comer brackets < > to describe language 100 
requirementsL This permits a deferment of 
the design of actual code for a language until 
after all the specifications have been deter- 
mined. 

The following tables 10 tlMOu^ 13 diow 105 
a preliminary partial assjgmnpnr of the let- 
ters of the al{^abet to sets and subsets and 
die Metalanguage symbols. 

Table 10 herdnbdow ^ows die notation 
used in die Metalanguage with defimtioos 110 
thereof. 

Table 11. provides definition of lower case 
letters of the al|dbabet and examples of use 
in the- Metalanguage; • - ^ — ^ "^-^Mriii^- 

Table 12 (a) defines informaticm element 115 
sets> while table 12 (b) ^lows the subset of 
various identifiers. 

Table 13 defines the subsets for various 
operators. 
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TABLE 10: Metalanguage Notation 
square brackets — used to enclose an optional part of the format. 



Braces — designate a choice among alternatives listed vertically. One 
of the altetnatives must be chosen unless one of the choices is under- 
lined designated it to be tlie choice by de&ult. 



underline — specifies default alternative. 



I vertical bar — separates alternatives. 

Used in simple definitions. 

< > Ckuner brackets — encloses English language used to describe or name 
a source language or intermediate language element^ structure or speci- 
" fication. . ..-^^ 

VL <i> the number, i, inside the comer brackets specifies the number of con- 
secutive occurrences of the unit, u. 

• . • repetition symbol — specifies that the immediately preceding unit 
(dement^ structure, or bradc^ed group) may occur a number of times 
in succession. 

XX. . . metalanguage literal — source language word of two or more characters 
with all letters capitalized, or else a source language spedal stmibol 
(character odier than a letter or digit). 

hx metalanguage literal specifying one source language character. It is 

specified by a lower case letter, h, subscripted by the character being 
specified. K'the character is a l^ter, it must be shown as a capital letter. 
When no possibili^ of confusion or ambiguity results, the source 
language symbols can be written as they occur without subscripting an h. 
The only exception known at present is the repetition symbol (. . .) 
which is a le^ symbol in the Jovial language. 

Specification of this symbol for Jovial will be by use of the subscripted 

X single capital letter — metalanguage notation for a generic term specifying 

a memb^ (any member) of a set of multi-element information UDoits such 
as statements (specified by the capital letter, S), and eapressions (specified 
by Ae letter, E). 

Xx. . . cqntal letter followed by one or more lower case letters — specifies a 
multi-element member of a subs^ of the set spedfied by tbs capital 
letter. For exano^le, Eln is the generic name for a member of the subset, 
L(^cal Numeric Egression, 



jle lower case letter — metalanguage notation for a generic term 
specifying a member of a set of primitive single element information 
units such as operators (specified by the lower case letter, o) and 
variables (specified by the lovra case letter, v). 

\ Xu>, \ * ' " -snbscr^ted lower-case letter — metalanguage ^generic terni for "a jprimi- ^ 
tive sii^e element which is a member of the si&s^ x^. . . of the primitive 
element set specified by x. For example, oi is the generic term for a logical 
operator. The set of logical operators is a subset of the set of all operators 
is a subset of the set of all operators, 0. 

Because there are so many subsets of identifiers, the major subsets have 
been assigned letters without subscripts in order to further reduce the 
dutter. Examples include i for int^er, w for reserved word, v for 
variable. These latters are subscripted as above to form subsets. 
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TABLE 11: Definidons 
Used as Element Used as Subscript 



Examples 



b 

c 



e 
f 



g 
li 



k 
1 

m 
n 



address label 



blank 

control indetifier 

data attribute 

element 
file name 



group (alternative) 
holletith (literal) 



index variable 
Giiteger) 



constant 



data element 
identifier, label, 

nntne 

operator 



parameter 



non-numenc ma 
(alphanumeric) 

arithmetic Ona 

address Oa 

binary Vb 
condition 

complex kc, Vo 

dimension dd 

decimal (fixed pourt) Ud, kd 

external ui^)acked Ve 

flag af 

prefix Out 

floating-point kt, vt 

function Ofn 

hoUerith cn, Oh 

hexadecimal Vb, kh 
inverse indirect address) oi 

fixed point vi 

instruction Owi 

conditional variable v^ 

logical Ob vi 

numeric mn, On 

optional Owo 

of&et ao» Oo 

pointer ap 
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- ' TABLE 11: " Definitions (doxitmued) 
Used as Element Used as Subscript 



Examples 



u 
w 

V 

z 



next instruction 
address 



picture 

relational 

statement 

sign 

type 

unsigned 
reserved word 



variable 

unknown element 

TABLE 12 

(a) Information Element Sets 

= <element> 
= n I 0 

n = <identifier> 

o = <operator> 

(b) Identifier Subsets 



dp 

Orel 

Be 

OfB 
Ow 



a == 



<identifier'-name, label>-operand> 
a I c I d I m 

= <address identifier — statement labd, filc> 
c = <control ideniifier> 
d = <data attribnto 
m = <data idcntifier> 
8t I ao I as I f 1 1 1 ap 
af = <flag labd> 
ao = <ofl&et address> 
ap = <memory address (painter)> 
as = <statenient label, subroutine name> 
f = <file name> 
t = <next instruction addrcss> 
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TABLE 12: (b) Ideixti£br Sid^sets (continued) 
c = b I Cw 

b = <blank> 

Cw == <reserved word nsed for control> 
d = ddjdp|dw|dt 

dd = <dimensi(m> 
dp = <cpictiire!> 

dw == <reserved word spedfying attnfaixte> 

dt =<type> 
m = ma I nin | Vo | vi 

ma = <non numeric data> 

mn = <real numeric data> 

Vc = <con^lex variablo 

Yi — <logical variablo 
ma = kwa I mc I mn I Vk 

k^a ~ <non-numeric figurative oon5tant> 

mc = <ccmditiQa name> 

mn = <bollerith word> 

Vk = <conditional variablo 
mn = i I j I kn I Vn 

i = <inieger variable, index> 

j = <int^;er variable, indea:> 

kn = <real numeric cQnstant> 

Vn = <real numeric variable> 
kn = kb I kd I kf I kh I kwn 

kb = <binary constant> 

kd = <decinial (fixed point) oonstant> 
j^:v ^ - ' <fldatiiig^ikiim - mj^.^ 

kh = <h£zadedmal constant> 

kvn = <nunieric figurative constant> 
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V - TABLE 12 : :(b)-Idaidfier Sid>sets (continued) ' 

Vn == Vb Vd I Ve I Vf I Vh I Vj 

Vb = <binary variabk> 

Vd = <dedmal variable> 

Ve = <extamal unpacked numeric variable> 

Vf = <floating-point vatiable> 

Vn = <biexadecimal variablo 

vj = <fixed-point variablO 

TABLE 13; Operator Subsets 
o = <operat3or> 

= Oa I Oc I Ql I On I Ow 

Oa = <address operator>. 
Oc = <control operator > 
oi = <logIcal operator> 
On = <numcric operator> 
Ow = <reserved word operator> 
Oa = 0| I Oo I Op 

Oi = <mverse address > 
Oo == <ofl&et> 
Op = <pointcr> 
Oc b I Oca I Ocn | Ot 

b = <blank> 

Oca = <non-numeric control > 

Ocn = <numeric control functions > 

Ot = <go to next instruction> 

Ol = Oip I Oir 

_ , oip =^<logLcalprefix,XNPT)> 

oip = <logical relation (AND, 0R)> 

On = Ona | Onf | Orel 

Ona = <arithmetic> 
ouf = <numeric prefix> 
Orel = <relational> 
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TABLE 13 (cogfeued) 

Oni = Ofn I OfB 

Of n = <iiiimeric functioii> 
OfB = <sign> 

Ow = 0^1 1 Owo 

Owl = <r€served word inst]:uction> 
Owo = <optional reserved word mstruction> 
Alternate Groupings 

Of = <prefix operator> 

= Ofn I OfB I Qlp 

Operator Subsets 

Oa = Oi I Oo I Op 

Oc = b I Oca I Oca | Ot 

Oca = <coiicatenation> | EOE | , 

Ocn = <function code> | <si]bscript code> | ( | ) | , 

Of = Ofn I OfB I Oip 

Ofn = SIN I COS I SQRT | <other £uiicdoiis> 

OfB = + I - 

Oi = .1. 

Oi = Qlp I Oir . : ;,vr 

oip = NOT I .NOT. 

oir = AND I OR I JVND. I .OR, 

On = Ona | Ofn | Ofs | Orel 

Ona = + I - I <X> I <-r> | <exp.> 

Onr = Ofn | Ofa 

Oo == -O. 

Op = JL 

Orel = ^i<i = i7^r>i>h<n=n< 

Ow = Owl I Owo 

Owl == IF I GOTO I DO I <any other reserved word instruction> 

Owo == WITH I THEN 1 AT I ADVANCING <any Other optional 
reserved word > 

Ot = <go to next instruction 
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TABLE 14: In&nnadon Structure Sets 

(The label on the left represents a member (any member) of the set 
whose members meet the requirements specified on the right) 

A = <storage structures> 

C = <operating system routine> 

D = <set of dataattribmes> 

£ = <e^ression ^ in IL — 1^ it. consists of an alternating sequence 
of idoitifiers and cperators> 

F = < Channels, hardware interrnpts, hardware indicators such 
as condition and overflow flags> 

6 = <Group composed of assorted structures £rom various subsets 
such as executable program statements, control instructions^ 
expressions, elements. > 

H = <Library> 

M = <data structnre> 

N = <identifier dictionary (symbol table)> 

O = <set of operators> 

P = <procedure, program, sub-routine> 

W = <reserved word dicdonary> 

S = <statement» induding delimita:> 

T = <decision table> 

V = <vector — n — dimension variablo 
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TABLE 15: Infonnation Structure Subsets 



E = 



£1 
Bin 



<expFession (parenthesis must be paired)> 

[of. , .] [{. , .] n [o [(. . .] [of. . .1 [(. . .] nD. . .]] 

£]a|Elii|Hra|Hni|£a|Eii|£fiL|£c|El 
Ela I Eln 

<logical numeric (decimal) e^ression> 



fvi oi 1 

~ LEra / \Em J 



Em = 



En - 



Ela = 



Era- = 



Ea = 



<relatiotial numeric e^pression> 
En [or En] . . . 

<numeric (arithmetic) e3q)ression> 

[oiif][(. . .]mn |^ona[(, . .] [onf. , .] [(. . .] mnQ. . .[^ . 
<Iogical esqyression of a]phanumerics> 

tEra J [ol tEra J ]. . . 
<re]ational es^nressions of alphanumerics> 
Ea [or Ea] . . . 
<alphanumeric expression> 



of 






[(. . .]ma 


oa 





{oca l*^^ 



J [of] [(. . .] maD. . .] 



Efii = [Qf]afiiEn 



Ec " = 



n.plnd 
vkol kii ' 

{mc }j 
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TABLE 16: Informatton Structure Subsets 

Groups 

G = <group composed of assorted structures ftom various subsets 
such as executable program statements, control instructions, 
egressions, elements. > 

= Gt|Gf 

Gti = <mstruction to be executed if conditional expression, £q, is 
true. If Gti is executed, Gfi is by-passcd> 

= ■ • ]. {!*■ } 

Gf| = instruction to be executed if conditional expression, Eci, is false. 
= [s.. . . ] } 

<the subscript, i, denotes the ith leyel of nested IF statements > 

ot = <sped£Les that tiie next instruction to be executed during 
program execution is the statement which follows the entire 
'IF* statement including all nested *1F" statements. 
It is an instruction to the compiler to generate coding as 
needed to carry out diis instruction execution sequence> 

TABLE 17: Information Structure Subsets 

Data Structures 



M 




<data structure> 


m 




<sin^ identifier of data (memory) > 


Ma 




<array of data> 


Mf 




<data in a file> 


Mg 




<a generation of hierarddal data> 


Mh 




<hiefarchy of data> 


Ms 




<a string of data> 


Mt 




<table of data> 
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TABLE 18: Infoimation Structure Subsets 

Statements 

S = <statement (indudiDg deliimter)> 

= Sc|Sd|Se 

Sc = <amtrol instructicm> 

Sd = <data dedaration statemeiit> 

Se = <executable statem£nt> 

= Sb I Sx I Si 

Sb = <uncondirional branch> 

Si = <IF stateinent> 

= IF Ec Gt owi Gf 

Sz = <ai^ executable statement except an unconditional branch or 
an IF statement> 



Table 14 shows die assigomeot of upper 
case alphabet letters to the major subsets in 
Mttalanguage. Table 15 defines the major 
5 subset "expiessicsis'' in terms of the next 
lower levd in the Metalanguage xu>tatioa 

Table 16 defines the major . informadoa 
subset "G" (groups) and how it is repre- 
sented in Metalanguage. 
10 Table 17 shows the asagnmeitf of suffixed 
cajutal letters to specify die various major 
subsets of data structures in Metalanguage 
notation. 

Table 18 shows the assignment of labds 

15 to the major types of statements whidi be- 
long to the set "S" (statements), anodier 
major infoimation structure subset 

The Metalanguage devdoped herein is 
organized so as to define a hierarchy of cotb- 

20 ccpts (useful in programming and computers) 
which are organized accoiding to levds of 
complexity and sojAisticatioa so that there 
is an easy lecognidon of the of^»?oximate 
hieraidiial levd of each ccmcept 

25 For exanqyle, with reference to Fig. 6, cer- 
tain useful informadon imits of various com- 
plexity are ^own in hierarchial rdadonship 
to each other by the line-tree drawing of Fig. 
6. The P. represents a program, procedure or 

30 ' set of programs. Now smce any statement is a 
member irf the set, S, of all statements, and 
sance;, S, is a siAset of P> S, is linked to P. 
Odier subsets of P are not ^own. Now since 
statements contain expressions, there is a fur- 

35 thex Knk to E, wiiidi represents expressions 
showing that E, is a subset of S. Since ex- 
pressions are cwnposed of dements, diere is 
a further link to e, which represents dements. 



The set of all dements breaks down into the 
subsets: identifiers, n; and operators, o. 40 

Hie identifier set^ n, may be subdivided 
into various subsets as shown by small letters 
a,b,c,f, g,h,i, d,m and the variable v, 
each of which can be further subdivided as, 
for example, the variaWe v in Vi» Vo v^, Vf, 45 

Vi, Vr- 

Other subdivisions of the dement set, e, 
mi^t be represented by n^, and etc. 

The operator set^ oi, may be subdivided as 
seen in Fig, 6 to o^^ . . , etc 50 

Fig. 7 shows a further organizadcHi of 
hierarchies of infonnadon in' the Metalan- 
guage. In Fig. 7 the set^ C, mi^t represeni 
the generic term for any member of the set, 
"control or oj^^ating system"; the letter M, 55 
might represent the generic term for any mem- 
ber of the set "data structure^*; the letter A, 
may represent an area of storage; the letter 
P, may repanesent a program; the letters S, G, 
and T, \^uch link to P, represents members 60 
of various sections or pieces (subsets) of the 
program P. Each of these symbols is furdier 
broiken down into organizations involving less 
ounplez and less complicated bits of inf ozma- 
drai. 65 

Table 19 diows a COBOL ,*;iF" statement 
^tedficadcHi whidi is written in IBM nota- 
tion. It should be noted as a cnmmwtr that 
some of the spedfications for sosnc of the 
teams in COBOL are never expMddy spoci- 70 
fied and defined in the reference manuals, as 
for example the term "NEXT SEN- 
TENCE". 

Table 20 illustrates die COBOL "IF' 
statement spedficadon as written in Backus 75 
Normal Form (BNF). 
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TABLE 19 

CX)BOL "IF* Statement Spedfication — Written in IBM Notation 



IF aridim^c — expression —1 IS [NOT] ^ EQUAL TO 

[7 



fGREATERTHAN 
EQUAL TO 
LESS THAN 



arithmetic — easptession —2 [THEN] fstatement— 1 . . . "] 

^ NEXT SENTENCE V 



{ELSE \ f statement-2. . . \ 

OTHERWISE / \NEXT SENTENCE /. 



Note: specifications for some of the t3eims are buried in the text of the 
manual OthcrtetinssuchasNEXT SENTENCE are never espfidtly speci- 
fied. 

TABLE 20 



COBOL *TF" 


Statement Specification — Written in BNF 


<symbQl> 




IF <arithm^c e3qpression> IS 


<symboI 1> 




<symbol> | <symbol> NOT 


<8ymbol2> 




<symbol 1>GREATER THAN | <symbol 1> 
EQUAL TO 1 <symbol 1> LESS THAN 


<symbol 3> 




<symblo 2><aritfmietic expression> 


<symfx)l 4> 




<syniboI 3> | <symbol 3>THEN 


<symbol 5> 




<symbol 4><e»ecutablc statement> | 
<symbol 4>NBXT SENTENCE 


<symbol 6> 




<symbHDl 555^ ELSE | <symbol 5> OTHERWISE 


<COBOL IF> 




<symbol 6><executable statement> | 
<symbol 6> NEXT SENTENCE 



} 



Table 21 shows the COBOL "IF" statement spedfication as written in the newly 
developed Metalanguage. 
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TABLE 21 

COBOL "IP* Statement In New Metalanguage 



nSLSE 

IF Boa, IS [ot] Orel Etta [THEN] Gt< V Gf 

^OTHERWISE 



} 



\dieTe: 



Gt = 



{!} 



ot 



[Sx. . .] 

<go to next statement following Gf> 
Gf = [Sx. 

<go to nest statement following Gf > 



-i {!} 



Ot 



Figure 10 shows gaierically how Ae "IF' 
statement is repn^ented in Metalanguage. 
Hie rectangular blocks refotsect "strings of 
5 infonnatioa units". Si is a geaeric statement 
of "IF" and compares to the Table 21 state- 
ment. 

In Fig. 10, an infonnaticm unit such as 
Eln is connected in terms of lower order 
10 hierarchies of information as shown in the 
Metalanguage aotatioUr 

Use of Metabngaage Foomat 
The new Metalanguage prcmdes a means 
of specifying the kif ormation content 

15 of various source languages as well as the 
first Intermediate Language^ IL — 1, and the 
second Intermediate Langimge^ IL—2, in a 
systematically labled commoa nfltatioa vdiich 
defines informanon vsdis in a precise, concise 

20 way. Therefcwe, identical infoxmadoa units 
can be easily identified In addition, the hier- 
archial level of infannati(H> units (degree of 
complexity or quaniSty of inf onnatiofn) can be 
recognized without knowing all the demea- 

25 tary details of the infoimatioia imit involved. 
As a result of this tool (the new Meta- 
lai^;uage) smnre language statements in vari- 
ous languages can be written m the new 
Metalangu^e for study and comparison to 

30 detennine similarities and difEerences in the 
v ^funcOMis provided by the various source lan- 
guagesy so that the mtennediate la^oage 
functioDS can be designed in an efficient man** 
ner. Without this Metalanguage, it would be 

35 necessary to jdace a heavy relim» on remem- 
bering all the complications, rescricdonst, and 
specifications for the inf ormadoo units in each 
of the source languages for a statement being 
studied. In addition, if this statement were to 

40 be reccmsidered at a later lime, one would 
again have to refresh his memory on each of 
the soiHce languages. 



As a result ci the work cm devd<^mig in- 
termediate languages^ parameters have thus 
been identified and organized at various levels 45 
of an information hierarchy as fdlows: 

(a) At least 15 major infonnatioa clas^ 
have been established for lai^ muld-element 
informatioa units such as statements (S) and 
expressions (E), 50 

(b) At least 35 major subsets of the above 
groups have been established — as midti- 
dement units sudi as logical expces^ons (El). 

(c) 18 primdpal primitive types have beai 
^tabli^ed — as sets of sm^e eJefments of 55 
infonnadon such as operators (cf), vaiial^ 
(v), etc 

(d) 44 suteets of OT^e dement groups 
have been e^ai^^ied - - of which examples 
inchide types of o{Krat[>r5 sudi as arithnietic 60 
operators, Ic^cal operators^ -and control oper- 
ator. Reference may be made to Figs. 6 and 

7 in this r^ard. 

Chae of the discoveries made in regard to 
the devdcqxment of the mformation hierardiy 65 
and the new Metlangiiagp notation may be 
illustrative of die discoveries involved. In the 
early stages of this work, all ^'reserved 
wards'* Qndnding instructions) were classified 
as "identifiers". Also^ "iiBtructians*' were 70 
considered identifiers of call routme& After 
experience and work with these original 
dassificatioos, it. was- found* from-^ experience > ^ 
that those "resraved words" used as instruc- 
tions or commands and "instructions" shoidd 75 
be designated and defined as "operators" 
rather ^\an as identifiers. 

Hereinbdow will be found a group of 
tables vMdx illustrate how various statemems 
frcm major source languages can be written 80 
in the new Metalanguage notation and how 
they tr ansform into statements in tiie first and 
second intermediate lai^uages and how they 
may be mapped. 
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The Mowing tables, taWes 15— 24 will which "ait fwirid- "in hi^ level kngi^gcs m-^ 

show ±e represeatatioa of various COBOL the left cohnnn, \iiiile the equivaleat ftmcticm 

and FORTRAN ftxnctiiCHis in Metzd^girage in IL— 1 format is ^own in the rig^t hand 

notation: column. 

Table 15 shows repsesuitadve functions 



Representative Functions 
Found In Frograms 



TABLE 15 



Rq>resentation in IL~1 Format 
Usmg Metalanguage Notation 



LogivallF 
Relational 



IFElGtGf 
IFElGtGf 



Sign (COBOL) 
Sign (FORTRAN) 



Condition 
On Count 

Unconditional Brandi 
Alter (assigned Branch) 

Computed Branch 
Iteration 



IFElGtGf 

IFS En Si Sg Ss 

IFCLnndGtGf 

IFCfvkgGtGf 

ONKiiKiaKisGtGf 

BR as 

ALTER as-i as 

GO as, asl, as2, ... 

GOTO i, asl, as2, ♦ , . asj 

DO cl al a2 vnl mnl 
mn2 mn3 Ell [a3 a4 

vn2 mn4 mnS nm6 

ELJaS a6 vn3 mn7 moB 
mn9mj] 

ASGNMT ow crd M E Se nm4 



Table 16 ^ows the Condidooal Branch 
(IF) of source language FORTRAN and 
souice language COBOL in Metalanguage 
notation. Table 16B for COBOL shows suiv- 
15 portions (a), (b), (c) showings in Me^anguage 
notation, three forms of conditional Inanch 
(IF) tests. 

Table. 16 subsection (d\ ^kvws the single 
IL— 1 statement into vf):ack everyone of these 



FORTTtAN and COBOL statements map. 20 

Subsection (e) Ksis the GeneraliTed Con- 
piler tasks for thes omditional brandi (IF) 
statements* 

Subsartion {f) of Table 16 shows a typical 
porttcm of the conditional branch IF state- 25 
mcnt in IL — 2. 

Subsection (g) illustrates how the condi- 
tional branch IF statement is mapped. 
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TABLE 16: Condidotial Bianch (IF) 
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(oonmared after glignmpnt of decimal points if all elements numeric) 
(A) FORTRAN 



IF(£ln) St 
(B) COBOL 



(a) LOGICAL "EBST 



IF El [THEN] Gt\OTHERWISE/ Gf at 
(b) RELATIONAL TEST 



> 
< 

GKEAimTHAN 
EQUAL TO 
LESS THAN 



IFBiIS[NOT] -{EQUAL TO J^EaCTHENGt . 

(c) SIGN TEST 



fELSE 

OTHERWISE 



Gfat 



}< 



fPOSmVE T fELSE 
IF En IS [NOT] -I ZERO J^[THEN] Gt -( 5^Gfat 
^NEGATIVE J l^OTHERWISE 

(d) n^i 

IFiEIiGtiGfi 

at 

I 



I 

(e) G. a TASKS 

<symbol table look-tq) of atdibutes of all elements > 

<oode to align dedmal points if atdibutes of all elements are 
■ \j nuineric> " • -■ - • . r ■ . ' ' 'i'- 

<reaiiange expression according to precedence assigned to 
operators >> 

<generate code to compute tiie value of the e^ression> 

<code to test logical value> 

<code to execute Gt and skip Gf if test result trtte> 
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<cade to execute Gf)> 

<repeat above steps for each IF statement> 

<repeat . > 

<die statement to be executed > ot < is the sta t em ent 
Mowing the last nested 'V statement In the n^l fonnat 
it is at > at <Nested "IP* statoncnts can appear in Gt or 
Gf> ' 

(f) 11^2 

a 0 n [n] 

(g) Conditional Branch (IF) 



MAP 



SOURCE 



MAPS INTO 



St 



Gfi 



NEXT SENTENCE 

<Ccgecntabl e statem ent . . . 
followmg THEN or 

El and H^^iiTnftwl by ELSE or 
OTHERWISE or 

a fceywdcd designating a new 
insmicdon> 

<executable statement . . . 
following ELSE or 

OTHERWISE dehmited by 
tihe start of a new instruo- 
tion> 

EiOrelEs 

Enord O.O 



= Gt 



rsb -) 

= [Sx. . -] -| Sii+i I 
= [Sx. • .] |s?i+i I 



= GOTOt 



=:Gt 



= Gf 

.=.E1 , 
= E1 
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- '^^^ " TABIJE 16: (g) Gt^ Ctmdmjed 

SOURCE 
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MAPS INTO 
(INIL— 1) 



GREATER THAN I 

EQUAL TO I 

LESS THAN! 

NOT GREATER THAN | 

NOT EQUAL TO I 

NOT LESS THAN| 

fposrrivEi 

ZERO I 
NEGATIVE I 
NOTPOSmVEl 
NOT ZERO I 
NOT NEGATIVE | 



< 

1> 

1= 

1< J 
>o 
= o 
< o 

l>o 



^ = Orel O.O 



TABLE 17 shows die FORTEUVN Sign Test stated in Metalanguage 
(a) and the equivalent statemoits in IL—I (b) and in IL— 2 (c). Sub- 
section (d) of Table 18 shows the mappimg. 

TABLE 17: Sign Test 

(a) FORTRAN 
IF(En)Si,S8,S, 

(b) H^l 

IFS En^ Sj9 S3 

(c) 11^2 



<check symbol table — all elements of E must be nuoo^o 
<oode to compute value of £n> 
COAtPEnO.O 
BRLSi 
BRES. 
BRGS, 
(d) MAP 



51 = <nextstatementtobeexecntedifthevalueoftherelational 

expression is n^tive.> 

52 = <next statfiment to be exeaited if the vahie nf th^ r pJi^'ni^a l 

e^nession is zera> 

53 = <negt statement to be executed if the vahie of the relationai 

espression is positive, > 
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Sknitoty to Table 17, table 18 shows th!c COBOL Class Test; Table 
19 shows the Condition Test for COBOL; Table 20 shows the COBOL 
function of Conditional Branch on Count; Table 21 shows the COBiOL 
and FORTRAN Unconditional Branch functions; Table 22 illustrates the 
COBOL and FORTRAN Alter functions. 

TABLE 18: Oass Test 

COBOL 

fNUMERIC T 
IFnIS[NOTl i V[THEN]Gt 
LALPHABETIC J 

IL-1 

IFCL n, nd, Gt, Gf 

11^2 

<Gode for character by character table look-up of dass specified 
bynd> 

BRE Gt 

EXEC Gf 

TABLE 19: Condition Test 

COBOL 



IF vk IS [NOT] kn [THEN] Gt 




faf ^ fELSE 1 

IF[NOTI^ ^|THEN]Gt ^ V G£. 

\mc J [OTHERWISE J 

n^i 

IFC f, ^ g, Gt, Gf 

D>-2 

<look up in symbol table to determine if mc or af or kn> 

<code to find mc corresponding to knif T*spedfied> 

<code to address and fetch the conditional variable fidd ^ in the. 
record currently in the file location,. f> > ; - 

<compare> vk g 

BREGt 

EXECGf 



fELSE 

Gf 

[OTHERWISE 



} 
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TABLE 19: Ccnrfiti(m Test QHtttmied 

MAPS 

where: g = mc | af | kn 

kn = <piogram siq)plied test vahie specifying one of the conditiQn 
names which die conditional vanable may represent > 

vk = < conditional variable (source program translator generates name 
and assigns a value of 1 to it in the case of forms overflow 
test)> 

<transktor supplies the file name^ f, of the file curiendy open. > 
TABLE 20: Conditional Branch On Count 



COBOL 



ON Ml [AND EVERY ki2] [UNTIL Id3] 



} 



fELSE 

Gt[ ^ yof] 

l_OTHERWISE 
IL-1 

0Nkilki21d3OtGf 

COMPILER TASKS 

<assign name to counter variable;, i add name to symbol table 
and set its value to 2ero> 

<de&ult ki3 — infinity. For both ki2 and ld3 blank (execute Gt 
once) 

defeult ki2 = kil, ki3 = HI + 1 > 



11^2 



i=i + l 

IFi^kl3GOTOa4 
al IF(kil~i)a2,a3,a4 
a2 kil = kil + ki2 

IF(kil— ki3)al,a4,a4 
a3 Gt 
a4 G - 

a4 Gf 
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TABLB 21 : Unconditional Branch 

FORTRAN 



GO TO artmt 
COBOL 



fapara 1 
laeec J 



GOTO 

BR as 
11^2 

BR as 

MAPPING REQUIREMENTS 

SOURCE n^i 



aetmt 
Apara 



TABLE 22: Alter 

(a) FORTRAN 



ASSIGN Betmi^ TOaatmt 
GO TO (aetmt i' astmt 2'- • 0 



(b) COBOL 



ALTER ^apsri^-i TO PROCEED TO 



GO TO [aparal- 
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TABLE 22: Aker. Cantmued 
l^ALTER as-i, as 

-^GO as asl a32 

<check that as-i in tlie ALTER statement is a legal member of 
liie set in the GO statemeno 

<look up in symbol table the address (loc2) of the branch to as> 

<Gode move instruction at location 1 to location 2> 

<code to branch past location 1 > 

loc 1 : <code to branch to as-i> 





loc 2: <code to branch to as> 
EXAMPLE: 



MVAR$1,$2,4 
BR $1+4 
$ 1 BR as-i 



$2 BR as 
MAP 

SOURCE EL— i 

afltmtri as-i , 

astmt as 

aparaJ as-i 

asec as 

apara as 
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If apara is omitted in die COBOL statement^ the GO TO statement must 
have a paragraph name > by the only statement in the paragraph and be 
modified by an ALTER statement before 1st execution. 

Table 23 iOustrates the Computed Branch funcdons for FORTRAN 
and COBOL, which^ as the above, indudes mappii^ requirements. Table 
24 shows some of the Assignment Statement Varieties in FORTRAN (A) 
and COBOL (B), followed in sequence the statements in IL — 1 fiirmat 
(c), the Generalized Compiler ta^ (d), the IL— 2 format (e), and the 
mappmg®. 

Subsection (h) of Table 24 shows mapping details appHcable to specific 
COBOL source statements coEtesponding to subsection B as related to 
IL — 1 format 



TABLE 23: Computed Branch 

FORTRAN 



GO TO (astmUa aatmlE* - . -)* i 

COBOL 



}r,aparB2 -t 
Ljaseca J- 



GO TO -{ y \ I DEPENDING [ON] i 

I^-l 



GO TO i, asl, as2, . . .asj 

IL— 2 



BRl 4-* 
<i = l> BRasl 
<i = 2> BR as2 



<i = j> BR asj 
Mapping Requirements 

In COBOL, if i>j, the GO TO statement is ignored and control passes 

to next sequential statement 

* designates current instruction address. 

SOURCE II^l 

astmt-i as-i 



apara-i 
asec-i 



as-i 
as-i 
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TABLE 24: AssigDjnait Stacement ^ 

(A) FORTTIAN 



(1) vl = El 

(2) vn = En 
(B) COBOL 



(3) COMPUTE ^ I [ROUNDED] 
; [ON] SIZE ERROR Se 



Lkn 

(4) ADD mn2 . . . TO mnl [ROUNDED] 



[ON] SIZE ERROR Se . . ^ 

(5) ADD nm2 mn2 . . . GIVING mnl 
IROUNDED] Q [ON] SIZEERRORSe . . , j 

(6) ADD CORRESPONDING AIn2 TO Mnl 
[ROUNDED] Q [ON] SIZE ERROR Se . . . j 

(7) SUBTRACT CORRESPONDING Mn2 
FROM Mnl [ROUNDED] 



[r[ON] SIZEERRORSe . ^ 



(8) SUBTRACT nm2 . . . 



fvnl [GIVING vnl] 1 
FROM ^ V 
l.kn GIVING vn J 

[ROUNDED] j^; [ON] SIZE ERROR Se . . . j 

(9) SUBTRACT nin2 FROM nml 

[ROUNDED] Q [ON] SIZEERRORSe • • • ] 
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TABLE 24: Assigamfnt Statement Contimied 

(B) COBOL 

fvnl [Giving vn] 1 

(10) MULTIPLY mn2 BY ^ Y 

Lfcn GIVING vn J 

[ROUNDED] [ON] SIZE ERROR Se . . . j 

fvnl [GIVING vn] ^ 

(11) DIVIDE mn2 INTO ^ 

l^kn GIVING vn J 

[ROUNDED] [REMAINDER mn4] 
[ON] SIZE ERROR Se . . • j 

(12) DIVIDE mn3 BY mn2 GIVING vn 
[ROUNDED] [REMAINDER mn4] 

[; [ON] SIZE ERROR Se . . . ] 

n^i 

ASGNMNT ow crd M E Se mn4 
G. C TASKS 



<process e^)ression3 E, to transform it fix>m algebraic notation into 
an operation sequ^ice based on standard precedence rules > 

<tiie synibol table is provided with the updated length of all ex- 
pressions, E, and executable statement grot^, Se> 

<if the source language precedence rdes are not standard, the 
source language translator must add parentheses as needed to 
convey the correct precedence information> 



11^2 

a [n] 0 n 



MAPPING 
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r-^-^v TABLE 24: Ass^nment Statement OMitiiiued 
GENERAL 

SOURCE LANGUAGE IL— 1 

vl M 

vn M 

mr M 

vnl <with vn blaiik> M 

mnl M 

=E1 E 

=En - E 

=kn E 

ROUNDED set crd =1 

ON SIZE ERROR Se. . . Se 

eletnent expressioii ow = 0 

ADD CORRES ow = 1 

SUBT. CORRES ow = 2 

CONCATENATE ow = 3 

SEPARATE ow = 4 

AtQi <=mnl-l . . ,> M 

Mna <= mn2-i . . . > E<Ei [o Ei] . . 
SOURCE STATEMENT 
NO 

(4) iim2[+ mn2] . . . -f mnl E 

(5) nm2{+mn2} ... E 

(8) ^ I {-mn2} ... E ^. 

(9) mnl — mn2 E 

fvnl 1 

(10) mn2*j^ I E 

(11) I /mn2 E 
Lkn J 

(12) mn3/mn2 E 



53 



U67,741 



53 



In the precedii^ sections involving descrip- 
tion, tiieie has been presented a computer 
system which is enhanced by an optinxal use 

5 of a gmeralized comipilesr. Fig. 1, shows a 
flow diagram whereby the source program 
in high level language is commumcated to a 
souroe language translator, SLT 10^ wherein 
ihcfie is a sq>araiioai of non-excxrutable infor- 

10 matfoQ and executable statements (in the 
EL — 1 language fonnat). Then th^ non- 
executable information is mapped into the 
Symbol Table 13 of die Generalized Com- 
pOer 11^ and the executable statements are 

15 communicated to Problem Transfomiation 
Module 12, Subsequently^ diis information is 
communicated from the Geoerah'zed Com- 
piler 11 to the Madiine Code Translator 
MGT 14 ^ch provides an output of the 

20 source program in jMimitive operation se- 
quence code. 

Fig. 2 illustrates a more ^x^fic embodi- 
mmt showing how eadi hi^ level source 
language (such as CQBOI^ Fortran^ etc.> is 

25 provided with its own qjecific soarce language 
tramslator (SLT) respectively, such as 10^, 
lOf, and ICL. Further, Fig. 2 also shows that 
cadi ho^ computer, such as No. 1, No. 2, and 
No. 3 is provided with its own individualized 

30 program package (MOT) deagnated as I4^ntu 
14m2, and 14ntt. 

Figures 11 through 14 show various cm- 
figuratio^ ol the system which can be used 
to suit particular requirements. 

35 Fig. 11 ^ows a computer system configura- 
tion using the GC for the conventional, as- 
rent procesang sequemoe of compiling a source 
program into an direct pFogranx in machine 
code for a specific cunent type machine^ the 

40 object program having been processed to the 
extent that it is ready for die computer to 
link, load, ^d execute the program to pro- 
cess die data input supplied at nm-time. 
When said object program is not beii^ 

45 to control the pioccssmg of data, it is scored 
on cards, t^, disc file;, or other secondary 
storage media. In Fig. 11 a sequential ^stem 
of pit^ram (^)erations is shown vdiearein the 
source pn^ram 30 is operated on by the 

50 Source Langirage Translator 10. The pro- 
gram, in EL — 1 fonnat, is then operated upoa 
by the Generalized Cooqwler 11 having an 
output program in IL^2 format. This pro- 
gram is processed by MOT 14 to provide an 

55 object program 42 which is in machine code 
for a* spec^ host ccHupucer. Data, as repre- 
sented in block 35, is combined for opera- 
tion with the object program 42, this c^ect 
program being the source program stored on 

60 cards, tape, or disc at times occunii^ between 
exeoitioas of the object program. Following 
this is the run-time machine code execution 
roittiiies designated as 44 which provides the 
output designated as Uock 45. 

65 This configuration replaces the standard or 



oonventibdial ccraipiler functions' great ^ 
amounts of memory and computer time are 
used to accranplish cosninlation by means of 
a separate compiler for each source language. 

In Fig. 11 block designated as die o^ect 70 
program 42 is the output of the compiler 
which can be stored on cards, tape, or disc 
with no necessity for actual execution of the 
program. In other words^ it may be hdd and 
stored for later execution as in oonvendonal, 75 
current computer systems. line 39 indicates 
that, si:rt>sequently, at some convenient time, 
the object program 42 may be utilized for 
execution in combination with the data block 
35. 80 

Fig. 12 shows a computer system configura- 
tion uang the Generalized Compiler in '^ch 
the source pr<^ram is processed into an 
object program and a symbol t^Ie at* the 
IL— 2 level These are stored in secondary 85 
storage until the program is to be executed. 
In diis configuraticMi, the IL^2 is die imple- 
mentation langtagp of the target or host com- 
puter \rfiich means that the computer has 
run-time interpretive routines which can exe^ 90 
cute the object pn^ram direcdy. Run-time 
interpretive routines perform the final trans- 
lation of IL — 2 into {Himitive madiine codes 
as necessary, and provide for type conversicm 
(for instance;> integer to floating point) as 95 
necessary based on data attributes furnished 
in a symbol table;, or as part of the input data, 
or in a data base. 

Object programs vsiiich arc in IL — 2 lan- 
guage are less vulnerable to d>solescence or 2 00 
diange than a conventional object program 
written in machine code for a ^>eci&: mach- 
ine. Object programs in IL — 2 can be used 
on any of a class of machines without recom- 
{Mling the source programs. RecompiUng X05 
would be necessary in the event - the new 
machine used a different implementation lan- 
guage. However, the implementation language 
does not change as often as madiine codes 
since die implementation language^ being no 
IL — ^2, is designed to be machin&^independent 
for a class of madiine. 

In Fig. 12 source program 30 is processed 
by SLT 10 and GC 11 into ari object pro- 
gram in IL — 2 desi^ated block 42. ITius, 115 
this object pn^piam in IL — 2 can later be 
executed in any caie of a class of computers. 
Usmg Rim Time Interpretive routines 43 
to process the block 35 as per the object pro- 
gram instnictiim sequence and the Symbol 120 
Table 13' (which may be in a data base), ari 
Output 45 of processed data may be eflS- 
dendy realized. 

Figs. 13 and 14 illustrate computer system 
ccmfigurations which differ from each other 125 
only in that in Fig. 14, a data base 36 is used 
rather thwi a Symbol Table as in Fig. 13. 
These particular computer systems (Fig. 13 
and Fig. 14) illustrate an ^^»roach to com- 
puter ardiitecture diat is not feasible without 130 
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' the Gaiei^ coocept^TKis^ ' 

proach visualizes the gqcml use of laige 
caauixon data bases and the eliminadon of 
data dedaratioQ from a large dass of source 

5 programs dieieby aynsiderably sunplifying the 
writing of source progranis. In conjunction 
with dte data base^ a separate compiler can 
be used to update the data dedaradooi infor- 
matioQ as needed 

10 Because the Generalized Compiler is de- 
signed to be iKcd with various source lan- 
guages making it a frequendy used part of 
the computer systm and because of the ad- 
vent of LSI technology i^uccng the cost of 

15 hardware-firmware^ it is feasible to consider 
implementing the various subiYNUines of the 
Geaeralized Qnnpiler m LSI chips. This 
use oi high speed circuitry could increase the 
ccHnpile speed by m cnder of magmtude 

20 dieteby doubling the total throAi^put of a 
computer system. 

In the ccmfiguration of Hgs. 13 and 14, the 
source prc^gram is translated by SLT 10 
(ndiich mi^t also be implemented in hard- 

25 waie^firmware) into an object program 41 in 
IL — 1 and a Symbol Table 13 (unless a data 
base 36 is being used). The ot^ect prc^ram 
41 and Symbol Table 13 are stored in secon- 
dary storage until esecutioin of the program 

^ is desired. During jwogram run-time^ die 
Goieralized Compiler 11 maps the source 
program statements frran IL— 1 iitto 11^2, 
mappmg as much as one Wock or procedure 
at a time. The run-time interpretive routines 

35 44 process or execute these IL — 2 com- 
mands in conjunction with the input data 
35 and data attribute information 
from the data base 36 or sym- 
bol table 13 producing program output 

40 44'. In case of i»ograms vrtrich^are to oe 
run many times, the output of the 'Cji^ieral- 
ized Compiler in IL — 2y nwcrocode, or other 
primitives, may be stored oo secondary stor- 
age as a new object program 45' v^ch will 

45 be used in subsequent executions of the pro- 
gram bypasang the Generalized Compiler. 

The Generalized Campiler-^ummary 
As a result of the language development 

50 and use to date, it is now possible to specify, 
with reasonaMe assurance, a basic general set 
of functions which can be used to describe 
the great majcsity of user programs being 
written today regardless of the source Ian- 

55 guage in which written. This makes possible r 
a new simplifying approach to cranj^er de- 
sign. 

Instead of trying to fit the compiling task 
into one elegant overall concept sudi as a 
60 Syntax-directed conquler or a ccHnpiier-com- 
piler, tiie system described herein has divided 
die task according to indq>endeiit parameters. 
Thus^, language trans]ati<»i is separated from 
problem transfonnation; in other words^ how 



" the 'jmibion is stated is separated from die 65 
pxiblem itsdf. In addition, the description 
of the data attributes and environment (data 
dedaratiott) is separated from the acticms to 
be perfotmed on the data (the executable 
statements). 70 

As a result, can^nler complexity has been 
reduced without placing new restrictions on 
the user. Instead of designing a oompletdy 
new compiler for eadi language for eadi new 
computer, all that is required is a new source 75 
language translator to translate frcnn one 
higb levd language to another vdien a change 
in source language is involved and a new 
translator to trandate from one machine-level 
code to anodier wiiai a new computer is 80 
involved. 

^ To implement this concept of separating 
problem transfonnation from language trans- 
lation. Two intermediate languages IL — 1 and 
IL — 2, were developed. Nonexecutable in- 85 
formation in IL is in the f cam of a symbol 
table. The executable input to the Problem 
Transfomiati(m Module is in hi^ level lan- 
guage;, IL — 1, which is source language inde- 
p^^ent llie executable output from the 90 
Problem Transformation Module is in 
machine-level operation code format, IL — 2, 
which is machine independent for a class of 

As a result, die Problem Transformation 95 
Module need not be red^gned for either a 
sourcfr-language or a computer chaE^. Be- 
cause of this design stability, the Problem 
Transformation Modide can be designed as 
a hardware-firmware unit thus reducing the 100 
work load of the CPU and the (^jerating 
system and increasing the speed of com|nliaig. 

As a further exciting posahiKty, the flow 
of jobs througji a cwnputer may be radically 
changed so that data attributes are stored as 105 
part of a common data base and are compiled 
and updated independentiy £rom executable 
programs by a specialized compiler which 
only compiles data attributes and e nv ironment 
information^, assigns storage addresses to data, 110 
and otherwise interfaces with the common 
data base. Users using common data already 
in the common data base do not have u> 
declare the attributes. Mixed data types are 
converted as necessary as they are fetched 115 
from the common data base during run time. 

As another possibility, the source language 
can be some conversational language which 
can be onnpiled on-line assuming the conver- 
sational language is a simple one-to-<»ie trans- 120 
lation into IL — 1 and assuming that the 
Problem TransfonnatiMi Module is a hard- 
wart-firmware tmit^ eq)edally if the source 
syntax recognizer is also impl^ented as fiim- 
ware. 125 

In summation, the compiler system de- 
scribed herein promises to make the com- 
puter function more manageable and thereby 
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-i, . ^to allcw more freedom and flexibili^. to dcr . 
vdop better computer systems and user lan- 
guages. 

WHAT WE CLAIM IS:— 

5 1. Data processing system v^en ccmditicHied 
by pfcgramming means \Mx^ includes trmis- 
lation means arranged to cause die data pro- 
cessmg system to translate a source pn^ram 
in a lug^ levd source language into an inter- 

10 mediate language which is a high level lan- 
guage indepqident of the source laiigua^, 
said translatioa means being effective;, in 
operaticm^ to separate executable statements 
(as hereinbefore defined) fnmi non-executable 

15 infosmadoii (as herein defined). 

2. Data processing system acconding to 
Qaim 1, wherein said non-executable mfor- 
maticm is stored in tabular form in a symlx^ 
table. 

20 3. Data processing system according to 
Qaim 2, wherein operands in said inter- 
mediate language are fanned by symbol table 
addresses. 

4. Data processing system according to any 



one of the preceding dainis,, whe£cm<*saidv-^^2^^^ 
programming means includes compiling means 
arranged, in operation^ to cause the data pro- 
cessing system to convert said source program 
from said intermediate language into a seccmd 
intermediate language which is a low level 30 
language. 

5. Data processing system according to 
Qaim 4, including means for storing said 
source prc^pram in said second intermediate 
language. 35 

6. Data processing system aocording to 
Qaims 4 or 5, wherein said programming 
means includes second translation means ar- 
ranged, in operation, to cause the data pro- 
cessing system to translate said source pro- 40 
gram from said second intermediate language 
into a machine language. 

7. Data processmg system substantially as 
hereinbefOTe descxib^ witli icfenence to the 
accompanying drawmgs. 45 

R. G. ROBINSON, 
Qiartered Patent Agent, 
Agent for the Applications. 
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